IO  \ 


ft 


MRC  Technical  Summary  Report  # 1700 

CONTROLLABILITY  AND  STABILIZABILITY 
THEORY  FOR  LINEAR  PARTIAL 
DIFFERENTIAL  EQUATIONS:  RECENT 
PROGRESS  AND  OPEN  QUESTIONS 

DAVID  L.  RUSSELL 


Mathematics  Research  Center 
University  of  Wisconsin-Madison 
610  Walnut  Street 
Madison,  Wisconsin  53706 

November  197  6 
(Received  August  4,  1976) 


/ / 
J 


1 ^ I 

d 1 


Dp  C 


JAN  18 


Approved  for  public  release 
Distribution  unlimited 


Sponsored  by 

U.  S.  Army  Research  Office 
P.  O.  Box  12211 
Research  Triangle  Park 
North  Carolina  27709 


and 


Office  of  Naval  Research 
Arlington,  Va.  22217 


UNCLASSIFIED 


IICukity  Claudication  of  this  page  rWTi.n  o«i«  tnnnJJ 


READ  INSTRUCTIONS 


REPORT  DOCUMENTATION  PAGE 


BEFORE  COMPLETING  FORM 

» RECIPIENT’*  CATALOG  NUMBER 


2 OOVT  ACCCIIION  NO 


s type  of  report  a period  covered 

Summary  Report  - no  specific 
reporting  period 

6 performing  org  report  number 


title  t tnd  Submit) 


CONTROLLABILITY  AND  STABILIZABILITY  THEORY! 
FOR  LINEAR  PARTIAL  DIFFERENTIAL  EQUATIONS: 
RECENT  PROGRESS  AND  OPEN  QUESTIONS.  rT 


AU  ThOWi) 


DAAG29  -7  5 -C  -,$#24 


David  L 


10  PROGRAM  ELEMENT.  PROJECT.  TASK 


9 performing  organisation  NAME  and  address 

Mathematics  Research  Center,  University  of 
610  Walnut  Street  Wisconsin 

537  06 


AREA  a WORK  UNIT  NUMBERS 


Madison.  Wisconsin 


M CONTROLLING  gfpicE  NAME  AND  AODRE5S 


See  item  18  below 


15  SECURITY  CLASS  ( ol  Ihlt  rtporl ) 


MONITORING  AGENCY  NAME  a AODRESSffl  dltlttmnt  from  Controlling  Ofllct) 


UNCLASSIFIED 


15-*  DECLASSl  F|  CATION /DOWN  GRADING 
SCHEDULE 


16  DISTRIBUTION  STATEMENT  (oi  th!  a Report) 


Approved  for  public  release;  distribution  unlimited 


17  DISTRIBUTION  STATEMENT  (of  the  •bflrtrel  entered.  Lit. Rio  ck  20,  It  d///«r«nf  from  Report) 


SUPPLEMENTARY  NOTES 

U.  S.  Army  Research  Office 
P.  O.  Box  12211 
Research  Triangle  Park 


Office  of  Naval  Research 
Arlington,  Va.  22217 


North  Caro 


19  KEY  WORDS  ( Continue  on  reverse  aide  II  neceeemry  and  Identity  by  block  number) 


Partial  differential  equations,  control,  observation,  stability,  hyperbolic 
parabolic,  distributed  control  systems 


ABSTRACT  ( Continue  on  reveree  elde  II  neceeemry  mnd  Identity  by  block  number ) 


‘This  paper  is  an  assessment  of  the  current  state  of  controllability  and 


observability  theories  for  linear  partial  differential  equations,  summarizing 


existing  results  and  Indicating  open  problems  in  the  area 


form 

1 JAN  79 


EDITION  OF  1 NOV  65  IS  OBSOLETE 


UNCLASSIFIED  jX  (?<  (P\CA^ 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  D«f«  Entered) 


UNIVERSITY  Or  WISCONSIN  - MADISON 
MATHEMATICS  RESEARCH  CENTER 

CONTROLLABILITY  AND  STABILIZABILITY  THEORY 
LINEAR  PARTIAL  DIFFERENTIAL  EQUATIONS; 

RECENT  PROGRESS  AND  OPEN  QUESTIONS1" 

David  L.  Russell* 

Technical  Summary  Report  #17  00 
November  197  6 

ABSTRACT 

This  paper  is  an  assessment  of  the  current  state  of 
controllability  and  observability  theories  for  linear  partial 
differential  equations,  summarizing  existing  results  and 
indicating  open  problems  in  the  area.  The  emphasis  is  placed 
on  hyperbolic  and  parabolic  systems.  Related  subjects  such  as 
spectral  determination,  control  of  nonlinear  equations,  linear 
quadratic  cost  criteria  and  time  optimal  control  are  also  discussed. 

AMS(MOS)  Subject  Classification  - 93BXX,  93CXX,  93DXX 

Key  Words  - Partial  differential  equations,  control,  observation, 
stability,  hyperbolic,  parabolic,  distributed  control 
systems 

Work  Unit  Number  1,  Applied  Analysis 


t Sponsored  in  part  by  the  Office  of  Naval  Research  under  Project 
NR  041-404  and  by  the  Department  of  the  Army  under  Contract  No. 
DAAG29 -7  5-C-0024.  Some  parts  of  this  article  were  written  with 
the  support  of  Institut  de  Recherche  d’Informatique  et  Automatique, 
Rocquencourt,  France  and  appear,  in  briefer  form,  in  "Seminaires 
IRIA". 

t Department  of  Mathematics,  University  of  Wisconsin,  Madison, 
WI,  53706. 


CONTROLLABILITY  AND  STABILIZABILITY  THEORY  FOR 
LINEAR  PARTIAL  DIFFERENTIAL  EQUATIONS; 

RECENT  PROGRESS  AND  OPEN  QUESTIONS* 

v David  L.  Russell’ 

1.  INTRODUCTORY  REMARKS 

We  have  taken  some  pains  in  devising  the  title  of  this  article  to 
avoid  committing  ourselves  to  the  impossible.  It  is  virtually  hopeless 
to  attempt  a complete  outline  of  the  development  of  control  theory  as  it 
relates  to  partial  differential  equations.  Comprehensive  bibliographies 
in  the  area  compiled  by  Johnson  [44]  and  Robinson  [85]  include  articles 
written  up  to  1971  and  list  several  hundred  titles.  It  is  safe  to  say  that 
a comparable  number  have  appeared  since.  Such  a diversity  of  con- 
cepts and  theories  leaves  one  in  some  confusion,  with  little  idea  as  to 
how  unity  is  to  be  imposed  upon  it.  No  such  unification  will  be  attempted 
here.  We  have  selected  one  part  of  the  whole  which,  with  admitted  bias, 
we  feel  to  have  some  importance  and  which  is  sufficiently  restricted  to 
permit  some  overall  pattern  to  be  discerned. 

Control  theory  as  a whole  is  divided  into  two  main  sub-categories. 

In  the  AMS(MOS)  Subject  Classification  Scheme  the  category  of  Optimal 
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Control  is  grouped  with  the  Calculus  of  Variations  in  Section  -49  while 
Section  93  is  devoted  to  "Systems,  Control".  Inevitably  there  is  some 
overlap  between  these  two  classifications  and  other  categories,  such 
as  Stochastic  Control,  e.g.,  could  be  singled  out  for  separate  listing 
if  desired.  But  this  basic  dichotomy  is  generally  recognized. 

Historically,  the  first  of  these  categories,  Calculus  of  Variations 
and  Optimal  Control,  developed  first  and,  within  the  mathematical  world 
at  least,  has  enjoyed  the  wider  publicity.  It  is,  in  fact,  not  uncommon 
to  hear  the  term  "optimal  control"  used  very  widely  without  apparent 
recognition  that  these  are  two  words,  the  first  very  much  modifying  the 
second.  This  is  the  lingering  effect  of  a strong  "first  impression"  - 
the  initial  amazement  of  the  mathematical  community  in  seeing  the 
subject  treated  seriously  by  no  less  a person  than  L.  S.  Pontrjagin  [7  6], 
[77]  with  a subsequent  revitalization  of  the  Calculus  of  Variations  on 
such  a scale  that  the  term  "renaissance"  is  not  wholly  inappropriate. 

In  this  respect  there  has  probably  been  less  confusion  on  the  part 
of  engineers,  for  whom  the  basic  problem  always  has  been  that  of 
understanding  and  modifying  the  behavior  of  systems  whose  dynamical 
equations  include  "control"  terms  under  the  influence  of  the  plant  op- 
erator - human  or  otherwise.  From  this  point  of  view  the  value  of  op- 
timization theory  and  procedure  is  well  recognized  but  is  not  seen  as 
the  ultimate  goal  of  the  system  analysis.  The  present  article  looks  at 
the  control  theory  of  partial  differential  equations,  or  "distributed 


parameter  systems",  from  a viewpoint  akin  to  this.  We  shall  be  con- 
cerned primarily  with  controllability,  observability,  stabilizability  and 


r 

i 

general  system  behavior  modification  for  certain  distributed  parameter 

« 

systems.  We  will  attempt  to  outline  the  theory  as  it  has  developed  in 
the  last  two  decades  and  indicate  some  significant  problems  which  still 
remain  unresolved.  It  is  unfortunate,  but  almost  inevitable,  that  the 
| v article  will  emphasize  work  either  done  by  this  author  or  strongly  related 

to  his  work.  For  this  one  can  only  offer  the  excuse  that  treatment  of 
material  with  which  the  author  has  not  been  intimately  involved  would 
of  necessity  lack  the  insight  th$t  one  expects  in  this  sort  of  article. 

While  the  area  of  controllability,  observability,  stabilizability 
and  system  modification  is  the  field  of  control  theory  traditionally  im- 
portant in  the  engineering  profession,  it  is  nevertheless  true  that,  as  a 
modern  subject,  its  development  can  be  traced  in  large  part  to  the 
emergence  of  optimal  control  theory.  For  the  most  part  (the  classical 
Wiener  filter  theory  is  an  important  exception  [74],  [110])  optimal  control 
theory  was  developed  in  the  "state  space"  setting  rather  than  in  the 
"frequency  domain"  setting  traditional  in  engineering.  This  meant  that 
those  who  wished  to  take  advantage  of  the  impressive  developments  in 
optimal  control  were  forced  to  translate  over  into  the  state  space  frame- 
work those  aspects  of  system  behavior  which  they  already  knew  in  the 
frequency  domain.  That  "nothing  was  lost  in  translation"  is  still  a 
matter  of  some  dispute  ( [4 Z ])  but  that  much  was  gained  can  hardly  be 
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denied.  A rich  structure  relating  control,  observation,  stabilization, 
filtering,  spectral  determination,  and  many  other  notions  emerged  in  a 
very  short  time.  Perhaps  best  known  are  the  contributions  of  Kalman 
and  Kalman  and  Bucy  [47],  [48]  which  have  subsequently  wholly  re- 
volutionized the  techniques  of  control  system  design.  These  were  ideas 
whose  time  had  come.  In  fact,  since  the  mathematics  involved  is  not 
particularly  exotic  for  the  most  part,  one  could  argue  that  these  were 
ideas  which  were  overdue  - which  helps  to  explain  their  very  rapid 
evolution  once  the  initial  steps  (whose  importance  we  would  not  minimize) 
had  been  taken. 

The  control  theory  of  partial  differential  equations  has  followed 
right  on  the  heels  of  that  for  ordinary  differential  equations,  but  with 
slower  and  heavier  tread.  Concerning  the  individual  steps  we  shall 
have  more  to  say  in  the  sequel.  Here  we  wish  to  remark,  with  some 
mystification  and  regret,  on  the  historical  and  persistent  neglect  of 
notions  "of  controllability  type"  on  the  part  of  most  mathematicians 
working  with  partial  differential  equations.  Here  again  we  have  an 
area  of  mathematics  which  is  somewhat  "overdue"  in  its  development. 

It  seems  possible  to  the  author  that  this  state  of  affairs  is  in  part  at- 
tributable to  the  persistent  influence  of  Hadamard's  definition  of  a 
"well-posed  problem".  (Perhaps  with  some  historians  we  can  see  that 
every  great  act  of  creation  has  a tendency  to  foster  limitations  as  well 
as  possibilities.  ) Given  an  equation 
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L(u,  f,  x,  t)  = 0 , 


V 


initial  conditions 

J(u( - , 0),  g)  = 0 

and  boundary  conditions  (or  other  side  conditions) 

B(u,  h)  = 0 . 

with  f,  g and  h known  functions,  well-posedness  requires  that  the 
solution  u = w(x,  t)  should  behave  in  a suitably  "regular"  manner  as 
f,  g and  h are  varied.  With  this  point  of  view  the  emphasis  is  placed 
upon  establishing  that  the  influence  of  f,  g and  h on  u is  "not  too 
great"  - modest  changes  in  f,  g and  h producing  correspondingly 
modest  changes  in  u . WTien  f,  g and/or  h are  regarded  as  controls, 
however,  the  emphasis  shifts  and  we  want  to  know  that  the  influence 
of  the  control  functions  on  u is  "not  too  little"  - that  a given  change 
in  w can  indeed  be  realized  with  an  appropriate  change  in  the  control 
function.  Thus  we  have  diametrically  opposite  emphasis  in  control 
theory  from  that  which  has  historically  motivated  those  working  with 
partial  differential  equations. 

It  will  be  seen  in  the  sequel  that  our  subject,  particularly  from 
the  point  of  view  of  observation  theory,  has  some  points  in  common 
with  the  relatively  new  field  of  ill -posed  problems.  This  is  explicitly 
recognized,  e.  g.  , by  Seidman  in  [10Z].  The  recent  interest  in  this  field 
together  with  the  developments  which  we  describe  represent  at  least  in 
part  "the  other  side  of  the  coin"  relative  to  the  historically  dominant 


emphasis  on  well-posedness  and  regularity.  One  may  hope  for  the  de- 
velopment of  general  recognition  of  the  complementarity  of  the  two  points 
of  view.  To  the  degree  that  this  article  assists  in  encouraging  such  an 
attitude  and  in  stimulating  interest  in  these  matters  generally  we  will 
count  our  present  effort  a success. 


2.  BASIC  CONCEPTS 


While  most  of  this  paper  will  deal  with  "concrete"  control  systems 
described  by  comparatively  simple  linear  partial  differential  equations, 
it  is  nevertheless  useful  to  introduce  some  fundamental  ideas  in  a more 
general  setting.  We  will  begin  by  reviewing,  very  briefly,  the  funda- 
mental notices  of  control  theory  as  they  apply  to  finite  dimensional  linear 
constant  coefficient  systems  and  will  then  indicate  how  these  notions 
may  be  extended  to  systems  in  HilbeYt  spaces.  (Most  ideas  extend 
also  to  Banach  spaces  [15],  [16]  but  we  restrict  attention  to  Hilbert 
spaces  in  this  paper  to  avoid  less  essential  difficulties.  ) 

We  begin  with  consideration  of  the  linear  constant  coefficient 
control  system 

x = Ax  + Bu,  x € En,  u e Em  (2.  1) 

with  A,  B matrices  of  appropriate  dimension.  We  will  be  concerned 
with  application  of  control  functions  u « L2([0,  T];  Em)  where  T is  a 
positive  number. 

Definition  2. 1.  The  system  (2.  1)  is  controllable  (in  time  T > 0)  if 

and  only  if,  given  points  xQ,  x^  £ En,  there  is  a control  u € L2([0,  T];Em) 

such  that  the  solution  of  (2.1)  which  satisfies 

x(0)  = xQ  (2.2) 

also  satisfies 

x(T)  = Xj  . (2.  3) 
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It  is  convenient  to  use  the  terminology  “u  steers  xQ  to  x^ 

(during  [ 0 , T 1 > ' ' • It  is  well  known  [K]  (and  we  do  not  repeat  the  proof 
here)  that  (2.  1)  is  controllable  just  in  case 

rank[B,  AB,  A^B,  a"*1  B]  = n . (2.4) 

As  this  condition  is  independent  of  T we  can  see  that  controllability 
of  (2.  1)  is  a property  which  either  holds  or  does  not  hold  for  all  intervals 
[0,  T],  T > 0,  and  we  may  speak  of  (2.1)  as  being  "controllable"  without 
reference  to  any  particular  interval.  This  property  does  not  generally 
extend  to  systems  x = A(t)x  + B(t)u  with  time  varying  coefficients  and, 
as  we  shall  see  does  not  extend  to  certain  partial  differential  equations 
even  if  we  do  have  constant  coefficients. 

Paired  with  (2.  1)  is  the  linear  observed  system 


y = Cy,  ye  e"  , (2.  5) 

w = Hy,  w € Er  , (2.  6) 

which  is  "dual"  to  (2. 1)  just  in  case  r = m and 

C = -A"  (2.  7) 

H = B*  . (2.  8) 


Definition  2.2.  The  linear  observed  system  (2.  5),  (2.6)  is  observable 

(in  time  T > 0 ) just  in  case  the  following  is  true;  whenever  y is  a 

non-zero  solution  of  (2.  5)  then  the  "observation"  w,  determined  by 

Z r 

(2.  6),  is  not  the  zero  element  of  L ([0,  T];  E ) . 
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One  can  show  rather  easily  [St] that  (2.  5),  (2.6)  is  observable 
(on  an  arbitrary  interval  [0,T],  T > 0)  just  in  case 


H 

HC 


rank 


HC 


HC 


n-1 


= n 


(2.  9) 


When  r = m and  (2.7),  (2.8)  are  true  the  rank  conditions  (2.4),  (2.9) 
are  evidently  equivalent  and  we  have 

Proposition  2.  3.  The  control  system  (2.1)  is  controllable  if  and  only 
if  the  dual  linear  observed  system  (2.  5),  (2.6),  (2.7),  (2.  8)  is  observable. 

It  is  known  ([47],  [60],  [103],  [105])  that  controllability  of  (2.1) 
is  equivalent  to  the  matrix 

T 


Z(T>  = / eA,T-'»  bb'  eA  «T-'»dt 
0 


(2.10) 


being  positive  definite  for  every  T > 0 and  observability  of  (2.  5),  (2.  6) 
is  equivalent  to 


T # 

W(T)  = f e°  1 H*H  eCt  dt 
" 0 


(2.11) 


being  positive  definite  for  every  T > 0 . If  Z(T),  given  by  (2.10),  is 
positive  definite,  a control  u satisfying  the  requirements  of  Definition 
2.  1 can  be  given  explicitly  ( [G ])  by 


* A (T-ti  - 1 AT 

U(t)  = B e 1 ’ Z(T)  (Xj-e  xQ),  tt  [0,T] 


(2.12) 
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If  W(T),  given  by  (2.  II),  is  positive  definite  the  solution  y(t)  of 
(2.  5)  can  be  recovered  from  the  observation  cu(t)  via 

T * 

/'-t  f i ^ 

y(t)  = e W(T)“  f e H uj(s)ds,  tc  [0,  T]  . 

0 

A very  elegant  formulation  of  controllability,  observability,  and  the 
duality  relationship  between  the  two,  has  been  set  forth  by  S.  Dolecki 
[15],  [16],  The  basic  idea  was  presented  by  Dolecki  in  an  unpuDlished 
report  and  was  later  incorporated  with  work  of  the  present  author  in  the 
joint  paper  [16].  While  [15],  [16]  treat  the  general  problem  in  Banach 
space  we  here  consider  only  the  Hilbert  space  theory  for  reasons  noted 
above. 

We  let  X,  Y,  Z be  Hilbert  spaces  (over  the  same  field,  ordinarily 
the  real  numbers  or  the  complex  numbers)  and  we  let  S:  X -*  Z be  a 
bounded  linear  operator  and  C:  &(C)  (domain  of  C)  C Y -*  Z a linear 
operator  with  £(C)  dense  in  Y.  Then  {X,  Y,  Z,  S,  C]  together  constitute 
an  abstract  linear  control  system  which  we  set  forth  diagrammatically  in 
Fig.  2.  1 

Y D S(C) > Z 

X 

Figure  2.1.  Abstract  linear  control  system 
Definition  2.4.  The  abstract  linear  control  system  {X,  Y,  Z,  S,  C}  is 
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(i)  controllable  if  ^ (C)  _>  (S)  ; 


(ii)  approximately  controllable  if  ^(C)  -^^(S)  . 

Some  intermediate  notions  can  also  be  introduced  but  these  will  suffice 
for  the  moment. 

To  indicate  the  application  of  this  theory  in  a setting  slightly  more 
general  than  (2.  1),  let  A in 

z = Az  + Bu  (2.13) 

At 

now  be  the  generator  of  a semigroup  of  bounded  operators,  e , t > 0 , 

on  the  Hilbert  space  Z . Let  the  control  u lie  in  a Hilbert  space  U 

2 

and  let  B:  U -*  Z be  a bounded  linear  operator.  Let  Y = L ([0,  T];  U)  , 

X = Z 0 Z . From  the  variation  of  parameters  formula 

z(t)  = eAt  z(0)  + f e ^ Bu(s)ds 

0 

we  see  that  u steers  zQ  to  z ^ during  [0,  T]  just  in  case 

T 

AT  r A(T-s)  , 

z,  - e zn  = e Bu( s)ds  . (2.14) 

1 0 0 

This  prompts  the  definitions; 

AT 

S:  X(=  Z ©Z)  - Z;  S(zQ,  z^  = z1  - e zQ  ; 

C:  Y(=  L2([0,  T];  U))  - Z:  Cu  = f eA(T"S)  Bu(s)ds  . (2.15) 

' 0 

If  (i)  of  Def.  2.4  holds,  there  is  a u for  which  Cu  = S(zQ,  z^)  . Then 
(2.  14)  obtains  and  the  control  u steers  zQ  to  z^  during  [0,  T]  . 


-11- 


In  the  case  of  finite  dimensional  systems  (2. 1)  with  S,  C constructed 

just  as  above,  (i)  and  (ii)  of  Definition  2.4  are  equivalent  since  a dense 
subspace  of  E must  be  E itself.  But  in  infinite  dimensional  spaces 
there  is  a genuine  distinction  between  controllability  and  approximate 
controllability.  If  (ii)  is  true,  then  for  every  £ > 0 we  can  find  u 

e 

such  that 


Cu2  ' S<zo’Zl,llZ<  £ ’ 


which,  returning  to  (2.  14),  means  there  is  a vector  z e Z with 

e 

II  z II  < e such  that 
e Z 


AT 

e zQ  + 


/ eA^T  Bu  (s)ds  = z + z , 
n e 1 e ’ 


i.e.  u steers  z to  a point  z + z within  a distance  e of  z . 

e u i e l 

This  is  the  accepted  meaning  of  approximate  controllability  [21]. 

It  is  clear  from  Definition  2.  4 that  we  obtain  the  same  property 
if  S is  replaced  by  another  operator  Sj  with  the  same  range.  Thus  in 
the  example  above  we  could  take  X = Z and  Sj  the  identity  on  Z and 
controllability  of  { Z,  Y,  Z,  Sj,C}  would  amount  to  the  same  thing  as 
controllability  of  (Z  © Z,  Y,  Z,  S,  C}  . However,  if  we  take  X = Z 


and  let  S.z  = - 


AT 


\zq  ~ _e  zq>  z0  € Z,  then  q(C)  D ^(S^)  means  only  that 

T 

+ r 

0 


„ AT  r 

O = e z + | 
0 J 


eA(T's)  Bu(s)ds 


can  be  solved  for  each  zQ  « Z,  which  means  that  each  initial  state  can 
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be  steered  to  zero.  This  is  not  the  same  as  our  earlier  property  unless 
AT  ■ • 

e is  invertible. 

The  meaning  of  approximate  controllability  also  varies,  depending 

on  our  choice  of  the  operator  Sj  . With  S equal  to  the  identity  the 

condition  3(C)  D 3 (S^)  means  the  system  can  be  steered  from  a zero 

initial  state  to  an  arbitrarily  small  neighborhood  of  any  desired  terminal 

state  (or,  equivalently  from  any  initial  state  to  an  arbitrarily  small 

AT 

neighborhood  of  any  desired  terminal  state).  With  = -e  the  con- 
dition 3 (C)  D 3 (Sj)  means  that  the  system  can  be  steered  from  an 

arbitrary  initial  state  to  an  arbitrarily  small  neighborhood  of  any  terminal 

AT  AT 

state  which  lies  in  the  range  of  e .In  most  cases  the  range  of  e 

is  dense  in  Z and  the  two  mean  the  same  thing.  But  there  are  cases 

AT 

where  the  range  of  e is  not  dense  in  Z and  then  the  two  types  of 
approximate  controllability  are  different. 

By  using  various  operators  S,  C one  can  discuss  a variety  of 
controllability  concepts.  Consider,  for  example,  a slight  modification 
of  the  above 

z = Az  + Bu  + Dv,  z(0)  = zQ  € z , 

with  the  "disturbance"  v in  a Hilbert  space  V . We  suppose  that  we 
have  an  observation 

co  = Hz,  H:  Z -*  W (2.16) 


where  W is  aiso  a Hilbert  space.  This  time  we  let 


■ 


X = Z © L2([0,  T];  V) 

Ze  = L2([e,T];  W),  e > 0 , 

and  define  S:  X — Z by  setting 
e 

(S(z 0,  v))(t)  = H eAt  z + H f eA(t'S)  Dv(s)ds,  te[e,T]  . 

0 

We  let  Y = L2(f 0,  T];  U)  and  define  C;  Y -*  Z by 

e 

(Cu)(t)  = H / eA(Us)  Bu(s)ds,  t « [e,  T]  . 

0 

Controllability  of  {X,  Y,  Z^,  S,  C },  i.  e.  ft(C)  D ^(S),  now  corresponds 

to  the  property  of  being  able  to  reduce  the  observation  (2. 16)  to  zero  on 

2 

[e,T]  with  an  appropriate  control  u € Y = L ([0,  T];  U),  whatever  the 

2 

initial  state  £ Z and  disturbance  function  ve  L ([0,  T];  V)  . 

One  point  which  should  be  made  is  this.  As  with  meat  and  poison, 
one  man's  controllability  (sometimes  called  exact  controllability)  is 
another's  approximate  controllability.  If  5?(Sj)  Dft(S)  and 
fX,Y,Z,Sj,C}  is  controllable,  then  {X,  Y,Z,S,C}  is  approximately 
controllable.  It  frequently  occurs,  however,  that  we  can  show 
{X,  Y,  Z,  S,  C}  approximately  controllable  without  being  able  to  identify 
any  reasonably  "concrete"  operator  Sj  for  which  {X,  Y,  Z,  S^C}  is 
(exactly)  controllable. 

It  is  fairly  well  known  that  control  and  observation  are  "dual  to 
each  other".  The  abstract  framework  permits  a rigorous  presentation  of 
this  duality  relationship.  Since  S and  C of  Definition  2. 4 have  been 
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i 


assumed  bounded  and  with  dense  domain,  respectively,  the  adjoint 
operators  S : Z -*  X and  C = fi(C  ) C Z -*■  Y may  be  defined  by 

(S*  z,  x)  = (z,  Sx)z,  x € X,  z e Z , 

(C*z,  y)y  = (z,  Cy>z,  ye  &(C),  ze  £>(C*)  , 

the  domain  &(C  ),  of  C being  the  set  of  ze  Z for  which  (z,  Cy)z 

can  be  extended  to  a bounded  linear  functional  on  Y . We  present  this 
diagrammatically  in  Fig.  2.2 


* 


Figure  2.2.  Abstract  linear  observed  system 

and  refer  to  {C  , S , Z,  Y,  X}  as  an  abstract  linear  observed  system. 

Definition  2.  5.  The  abstract  linear  observed  system  {C  ,S  , Z,  Y,  X}  is 
(i)  distinguishable  if  ker  C C ker  S ; 

(if)  observable  if,  in  addition  to  (i),  there  is  a positive  number  K 
such  that 

llc*z||Y  > K||s*z|lx,  z € fi(C*)  . 

The  linear  observed  system  dual  to  (2.13)  is 

. * 

; = -A  ; 

* 
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(2.17) 

(2. 18) 


fe 


If  we  choose  X = Z,  S = identity  on  Z,  then  S*  is  likewise  the 


identity  on  Z.  If  we  define  C by  (2.  15)  then  C*:  Z - Y = LZ([0,  T];  U) 


is  given  by 


* * A it 

(C  Zj)(t)  = u(t)  = B e U c Z . 


(2.19) 


We  may  regard  t,  « Z as  supplying  a terminal  condition  for  (2.17) 

* -A*t 

(-A  generates  a semigroup  of  bounded  operators  e for  t < 0)  , 


the  terminal  condition  being  given  at  t = T . In  this  setting  (i)  above 


merely  says  that  if  ^ * 0 then  (C  ; )(t)  = w(t)  is  not  zero  in 


L ([0,  T];  U),  which  in  turn  implies  that  different  terminal  states  E,  , £ 


for  (2. 17)  yield,  via  (2.18),  different  observations  to(t),  C(t),  re- 


spectively. Hence  the  term  "distinguishability" . On  the  other  hand  (ii) 


goes  further  to  say  that 


lc\l  2 

L ([0,  T] ; U) 


> k||; 


l z * 


This  is  much  stronger  since  it  implies  the  existence  of  a bounded 


"reconstruction"  operator  R = L (fO,  T];  U)  - Z for  which 


Rw  = RC  = Z,v  ^ € Z , 


so  that  the  state  £ can  be  continuously  reconstructed  from  the  ob- 


servation cj  = C «;  . 


In  [16]  the  following  theorems  are  proved. 


Theorem  2.  6.  The  abstract  linear  control  system  (X,  Y,  Z,  S,  C]  is 


approximately  controllable  if  and  only  if  the  abstract  linear  observed 


7f 

system  f C , S , Z,  Y,  X } is  distinguishable. 


Theorem  2.7.  The  abstract  linear  control  system  {X,  Y,  Z,  S,  C } is 


(exactly)  controllable  if  and  only  if  the  abstract  linear  observed  system 
{ C , S , Z,  Y,  X } is  observable. 

We  remark  that  most  controllability  results  for  linear  systems  are 
proved  by  a method  which,  in  direct  application,  amounts  to  establishing 
the  corresponding  observability  property  for  the  dual  system.  This  pattern 
will  be  observed  repeatedly  in  the  sequel  but  some  exceptions  will  also 
be  noted. 

Perhaps  the  most  important  "message"  of  the  foregoing  abstract 
treatment  is  that  controllability  and  observability  questions  are  familiar 
questions  of  solvability  of  linear  equations  in  function  space.  Theorems 
2.6  and  2.7,  in  fact,  are  just  paraphrased  versions  of  the  standard 
functional  analysis  theorems  which  relate  the  range  of  an  operator  to 
the  null  space  of  its  adjoint  (see,  e.  g.  [17]).  Nevertheless,  the 
questions  we  ask,  and  the  case  we  make  of  the  answers  are  sufficiently 
distinct  and  particular  that  we  can  legitimately  regard  our  subject  as  a 
separate  mathematical  area  in  its  own  right. 

One  of  the  main  concerns  of  control  theory  is  the  relationship 
between  controllability  and  stabilizability.  For  the  finite  dimensional 
system  (2. 1)  we  have 

Definition  2.8.  The  control  system  (2.1)  is  stabilizable  if  there  exists 
an  m X n matrix  K such  that  the  linear  feedback  control  law 

u = Kx  , 
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substituted  in  (2.  1),  yields  a "closed  loop"  system 


x = (A  + BK)x  (2.  2 0) 

in  which  A + BK  is  a stability  matrix,  i.  e.  , all  of  its  eigenvalues  have 
negative  real  parts. 

The  relationship  between  controllability  and  observability  for 
finite  dimensional  systems  is  well  known.  We  have 
Theorem  2.  9.  If_  (2.  1)  is  controllable  then  it  is  also  stabilizable; 
moreover,  it  is  possible  to  prescribe  any  complex  numbers  X^,  X^,  X 

as  the  eigenvalues  of  the  closed  loop  matrix  A + BK  by  appropriate  choice 
of  the  feedback  matrix  K . 

Perhaps  the  nicest  proof  of  the  first  part  of  the  theorem  is  due, 
independently,  to  Lukes  [60]  and  Kleinman  [49]  who  showed  that  if  (2.1) 
is  controllable  then 

K=-B *(/  e'At  BB*  e'A  tdt)'1,  T>0  , 

0 

* AT  -1  AT 

= (cf.  (2.  10))  = - B e 1 Z(T)  e 1 (2. 21) 

yields  a closed  loop  matrix 

$ 

* A T -1  AT 
A+  BK  = A - BB  e Z(T)  e 

whose  eigenvalues  have  negative  real  parts.  The  second  part  of  the 
theorem  is  established  by  reducing  (2.  1)  to  control  canonical  form  (see 
[55],  [111]).  We  shall  carry  this  out  in  Section  4. 

It  is  not  in  general  true  that  if  (2. 1)  is  stabilizable  then  it  is 
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controllable.  One  need  only  take  A to  be  a stability  matrix  at  the  outset 
and  B = 0 for  a counter-example.  However,  the  "pole  placement  property", 
i.  e.  , being  able  to  place  the  eigenvalues  X^,  X.  , . . . , of  the  closed 
loop  matrix  A + BK  at  will  in  the  complex  plane  does  imply  controllability. 
In  fact,  a somewhat  weaker  condition  implies  controllability: 

Theorem  2.  10.  If  (2.1)  can  be  stabilized  both  forward  and  backward,  i.e. 
if  feedback  matrices  K,  K can  be  found  such  that  A + BK  has  only 

A 

eigenvalues  with  negative  real  parts  while  A + BK  has  only  eigenvalues 
with  positive  real  parts,  then  (2. 1)  is  controllable. 

Theorem  2.10  is  not  very  significant  in  connection  with  finite 
dimensional  systems  since  even  more  is  true.  If  no  eigenvalue  of 
A + BK  is  invariant  with  respect  to  all  admissible  feedback  matrices  K 
then  (2.1)  is  controllable.  But  Theorem  2.10  is  important  in  the  sense  that 
its  extension  to  infinite  dimensional  systems  sometimes  provides  the 
only  means  available  for  establishing  controllability.  This  is  particularly 
true  for  certain  hyperbolic  systems  to  be  discussed  in  Section  5.  Theorem 
2.10  is  proved  in  [93]. 

For  finite  dimensional  systems  the  property  that  all  solutions 
x(t)  of  (2.20)  tend  to  zero  as  t -*  +oo  is  equivalent  to  the  property  of 
all  eigenvalues  of  A + BK  having  negative  real  parts,  which  in  turn  can 


be  used  to  show  that 


I (A+BK)t  |i  w -vt 
e ||  < M e , t > 0 


J 
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for  certain  positive  constants  M,  y,  depending  only  on  the  matrix 
A + BK  . Thus,  under  such  circumstances,  solutions  of  (2.20)  have  the 
property  of  uniform  exponential  decay; 

ll  x(t)  ||  < Me'Vt  ||  x(0)  ||  . (2.22) 

For  infinite  dimensional  systems  the  situation  is  not  as  simple.  It  is 
possible  to  have  all  solution c of  the  system  tending  to  zero  without  any 
uniform  decay  rate  such  as  (2.22)  applying.  Perhaps  not  surprizingly, 
controllability  in  the  sense  of  (i)  of  Definition  2.4  can  be  associated 
with  uniform  exponential  decay  in  certain  distributed  parameter  systems 
while  approximate  controllability,  (ii)  of  Definition  2.4,  can  be  associated 
only  with  the  weaker  notion  of  all  solutions  tending  to  zero  as  t — +^o  . 
The  results  thus  far  obtained  along  these  lines  pertain,  for  the  most 
part,  to  specific  types  of  systems  and  are  left  for  treatment  in  later 
sections  of  this  paper. 
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5.  linear  symmetric  hyperbolic  systems 

Perhaps  the  simplest  of  distributed  parameter  systems  are  those 
described  by  linear  symmetric  hyperbolic  systems  of  first  order  partial 
differential  equations  in  two  variables  t (time)  and  x (a  single  space 
variable).  They  have  the  additional  advantage,  from  the  didactic  view- 
point, of  possessing  sufficient  complexity  to  permit  exposition  of  a 
wide  variety  of  questions  which  are  of  interest  to  those  working  with 
distributed  parameter  control. 

We  consider  the  system 

- A(x)  + B(x)w,  w c En,  t > 0,  0 < x < 1 , (3.  1) 

3t  ox 

wherein  A(x)  is  a continuously  differentiable  real  symmetric  n xn 
matrix  function  and  B(x)  is  a real  continuous  n Xn  matrix  function 
for  0 < x < 1 . We  further  suppose  that  the  eigenvalues  \k(x)  of  A(x) 
satisfy  \j(x)  < \.,(x)  < . . . < Xp(x)  < 0 < Xp+1(x)  < Xp+2(x)  £ • • • i Xp+Q(X)  ’ 
0 < x < 1 (p+q  = n);  and:  (l  x £ [0,  1]  3 X^fx)  = Xk+^(x)}  =»  fvx  £ [0,  1], 

= Xk+i<x^  • 

A lemma  proved  by  Phillips  [7  5]  allows  us  to  reduce  (3.1)  to  a 
convenient  standard  form.  There  it  is  shown  that  there  is  a continuously 
differentiable  orthogonal  n x n matrix  function  O(x)  such  that 

O(x)"1  A (x)  O(x)  = 0(x)T  A (x)  O(x)  a A(x) 

is  diagonal: 
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(3.2) 


A(x)  = diag(Xj(x),  X^x),  . ..,  X^x))  . 


The  change  of  variable 


w(x,  t)  = 0(x)  w(x,  t) 


in  (3.1)  gives  the  equation  in  standard  form 


dw  d w 

— = A(x)  — + B(x)w,  t > 0,  0 < x < 1 , 


(3.  3) 


with  A(x)  as  indicated  and 

B(x)  = 0(x)T  [B  (x)  O(x)  + A (x)  O'(x)]  . 

We  note  that  A(x)  is  still  continuously  differentiable  and  B(x)  is  still 
continuous. 

Now  A(x)  can  be  written 


A(x)  = 


rA_(x)  O \ A"(x)  = diagJX^x),  . . . , Xp(x))  , 

+ r + 

o A (x)  j A (X)  = diag(X  (x),  . . . , Xn+a(x))  , 


(3.4) 


p+q 


and  w can  be  partitioned  as 


w , 

+ ^q 

w = | l , w € E , w « E 
■f" 
w 


(3.  5) 


The  "characteristics"  corresponding  to  the  eigenvalue  X^(x)  are  solutions 
of  the  differential  equation 

dx 

3?  - -\fx>  ■ 


These  are  denoted  generally  by  c , or  more  specifically  by  c (x,  t) 

K K 
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if  we  wish  to  specify  the  characteristic  which  passes  through  the  point 

k 

(x,  t)  . On  c the  k-th  component,  w ',  of  w satisfies  the  linear 
k 

ordinary  differential  equation 

' k 9wk 

^ [wk(x(t),  t)]  H (x(t),  t)  - Xk(x(t))  — (X(t),t) 

BK(x(t))  w(x(t),  t),  k = 1, 2,  . . . , n , (3.6) 

where  B'‘  denotes  the  k-th  row  of  B . These  n equations  are,  in 

general,  coupled  by  the  terms  B ‘w  and,  since  they  are  valid  on  different 

characteristics  c,  , this  coupling  is  of  a more  complex  variety  than 
k 

that  normally  encountered  in  systems  of  ordinary  differential  equations. 
Initial  values  at  t = 0 are  provided  when  one  specifies 

w(-  , 0)  = WQ  e (L2  fO,  1];  En)  . (3.7) 

Consider  now  a point  (0,tQ)  as  shown  in  Fig.  3.1.  At  (0,tQ)  the 
"incoming  information"  consists  of  values  of  w associated  with 
characteristics  c^(0,  tg),  k = P+l>  • • • > P+P  = n which  reach  (0,  t^) 
after  passing  through  the  region  t < tQ,  x > 0 . On  the  other  hand,  the 
"outgoing  information"  consists  of  values  of  w associated  with 
characteristics  c^(0,t^),  k = 1,  2,  . . . , p,  leaving  (0, t^)  to  enter  the 
region  t > t , x > 0 . Referring  back  to  (3.4),-  (3.  5)  we  see  that,  along 
the  boundary  x = 0,  w is  the  incoming  information  and  w is  the 
outgoing  information.  It  is  easy  to  see  that,  along  the  boundary  x = 1, 
the  roles  of  w~  and  w+  are  just  reversed. 
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Figure  3.  1.  Boundary  configuration  at  x = 0 


If  the  boundary  conditions  for  the  original  system  (3.1)  are  written 
as 

CQ  w (0,  t)  = 0,  Cj  w(l,t)  = 0 , (3.8) 

then,  under  (3.2),  they  transform  to 

C0  0(0)  w(0,  t)  S CQ  w( 0,  t)  = 0 , 

C1  0(1)  w(l,t)  = Cj  w(l,t)  = 0 , 

and  CQ,  can  be  partitioned  in  agreement  with  (3.  5)  to  give 


Q_  w'(0,t)  + CQ+  w+(0,t)  = 0 , 

(3.9) 

u w"(l,t)  + C1+  w+(l,t)  = 0 . 

(3.10) 
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The  basic  requirement  for  existence  and  uniqueness  of  solutions  of  (3.1), 

( 3.  8)  (or  ( 3.  3),  (3.  9),  ( 3.  10))  is  that  at  each  boundary  point  the  outgoing 
information  should  be  determined  by  the  incoming  information.  This 
implies  that  (3.9)  can  be  solved  for  w (0,  t)  and  (3.10)  can  be  solved 
for  w+(l,  t); 

w (0,  t)  = Dq  w+(0,  t)  , (3.11) 

w+(l,t)  = Dj  w (1,  t)  . (3.12) 

The  type  of  control  which  we  will  consider  in  this  section  is 
boundary  value  control.  When  control  is  exercised  the  second  equation 
in  (3.  8)  takes  the  form 

C w(l,t)  + D u(t)  = 0,  u « Eq  , 

and  we  suppose  that,  after  the  transformation  indicated  above,  this 
results  in  (3.12)  being  replaced  by 

w+(l,  t)  = D w"(l,  t)  + D u(t)  (3.13) 

with  D a nonsingular  q xq  matrix.  This  means  that  the  control  vector 
u "dominates"  the  outgoing  information  at  x = 1 . 

The  basic  existence  and  uniqueness  theorem  for  solutions  is  (see 
[11],  [17],  [31]). 

Theorem  3. 1.  The  linear  operator 

Lw  = A + Bw  ( 3. 14) 

9x 

defined  on  the  domain  in  LZ([0,  1];  En)  consisting  of  functions 
we  H^fO,  l];En)  which  satisfy  boundary  conditions  of  the  form  (3.  11), 
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I 


I 


(3.12)  generates  a semigroup  S(t)  of  bounded  operators  on  L^([0,  1];  En) 

( group  if  D and  D ^ arc  both  invertible,  which  can  be  true  only  if 

p = q = n/2)  . Given  an  initial  state  (3.7),  the  system  (3.3),  (3. 11), 

^ n 

( 3. 12)  has  the  generalized  solution  w t <l  ([0,  «*);  L ([0,  1];  E ))  given  by 

w(-  , t)  = S(t)  wQ  . 

When  the  boundary  condition  ( 3. 12)  is  replaced  by  (3.13)  we  again  obtain 
a generalized  solution  we  C([0,  oc).  LZ([  0,1];  En))  of  the  general  form 

w( * , t)  = S(t)wQ  4-  C(t)  u(t)  (3.15) 

where  u^  denotes  the  restriction  of  u to  [0,t]  and  C(t):  L2 ( [0,  t ];  E^) 
2 n 

-*  L ([0,  1];  E ) is  a bounded  linear  operator  for  each  t . 

We  remark  that  smoother  solutions  can  be  obtained  if  w,  u satisfy 
certain  consistency  conditions  and  wQ  is  smooth  . See  [11],  [17],  [31] 
for  details.  It  is  known  from  general  semigroup  theory  that 

II  S(t)  ||  < M eVt,  t > 0 , 


for  some  M,  y depending  only  on  A(x),  B(x),  D , . A specific 

estimate  of  M,  y appears  in  [91] . Then  ( 3. 15)  gives 

II  w(  - , t)  ||  < M e^  ||  w || 

L <[0,l];En)  L2([0, 1];  E”) 


+ PtHuH  2 


(3.  16) 


L ([0,  t];  EM) 


We  are  now  in  a position  to  state  the  control  problem  for  the  control 
system  (3.  3),  (3.11),  (3.13): 
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Control  Problem.  Given  wQ,  w^  £ l/([0,  1];  E n)  and  T > 0,  to  de- 
termine, if  possible,  u c l/([0,  T];  Eq)  such  that  the  solution  w of_ 
(3.3),  (3.7),  (3.11),  ( 3.  13)  also  satisfies 

w(-,T)  = Wj  . (3.17) 

(i.e.  to  "steer"  solution  of  ( 3.  3),  (3.11),  (3.13)  from  w^  to  during 
f0,  T])- 

It  turns  out  that  this  problem  can  always  be  solved  if  T is  suf- 
ficiently large  in  the  case  where  S(t)  described  in  Theorem  3.  1,  is  a 
group.  In  other  cases  the  results  are  more  complex  and  Wj  must 
frequently  be  further  restricted. 

We  will  begin  with  a brief  description  of  a constructive  method 
whereby  certain  controllability  results  can  be  obtained.  This  will  be 


Then  the  above  control  problem  can  be  solved  for  the  case  w^  = 0 . If 
S(t)  is  a group,  i.  e.  , p = q = n /2  and  DQ,  are  invertible,  then 
the  control  problem  can  be  solved  for  general  Wj  £ L2([0,  1];  En)  . 

Sketch  of  Proof.  The  significance  of  the  inequality  (3.18)  lies  in  the  fact 
that  when  it  is  satisfied  the  rectangular  region  0 < x < 1,  0<t<T  can 

be  divided  into  three  separate  regions  by  the  characteristics  c ^^(1,0) 
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and  c (1,  T),  corresponding  to  the  slowest  negative  and  positive  wave 
P 

speeds.  This  decomposition  is  exhibited  in  Figure  3.2. 


Figure  3.  2.  Decomposition  of  0 < x < 1,  0<t<T 

We  note  that  there  is  an  upper  "triangular"  region,  A T,  above  c (1,  T) 

and  a lower  "triangular"  region,  A Q,  below  cp+^(l,  0)  . These  are 

separated  by  a central  "trapezoidal"  region  A . It  is  the  inequality 

P 

(3. 18)  which  ensures  that  c (1,  T)  and  Cp+1(l,  0)  do  not  meet  to  the  right 
of  the  line  x=0.  All  characteristics  c (1,  0),  p + 1 < k < n,  lie  in  AQ 
and  all  characteristics  c^(l,  T),  1 < k < p,  lie  in  A ^ . These  circum- 

stances allow  solutions  w of  (3.  3)  to  be  constructed  in  A from  the 
condition 

w(-  , T)  = 0 (3. 19) 
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For  the  former 


and  in  A from  the  condition  ( 3.  7)  i.  e.  w(*  , 0)  = wQ  . 


we  need  only  set 


w(x, t)  s 0,  (x, t)  € A , 


The  construction  in  A is  not  quite  as  quick  but  still  very  straight- 
forward. The  condition  w(-,0)  = wQ  together  with  the  boundary  con- 
dition (3.  11)  along  x = 0 can  be  used  together  with  the  integral  equation 
reformulation  of  (3.6)  to  construct  w in  the  "domain  of  determinacy" 
of  {t  = 0,  0 < x < 1},  which  is  the  region  between  that  line  segment 

and  the  characteristic  c (1,  0)  . To  proceed  further  it  is  then  necessary 
to  prescribe  (arbitrarily)  values  for  wn(0,t)  on  the  segment  1^  of 
x = 0 cut  out  by  c (1,  0)  and  c ^ ( 1 , 0)  . When  this  has  been  done  w 

is  determined  in  the  region  between  these  two  characteristics.  At  the 

, , , , n , n-1 

next  stage  arbitrary  values  are  assigned  to  both  w and  w on  1^  ^ , 

the  segment  of  x = 0 cut  off  by  c (1,  0)  and  c (1,  0)  and  so  is  then 

n-l  n-2 

determined  in  the  region  between  this  next  pair  of  characteristics.  This 

process  continues  until  w has  been  defined  in  all  of  A Q . Finally, 

arbitrary  values  are  assigned  to  wPf^,  wP+2,  . . . , w11  on  I , the 

segment  of  x = 0 cut  off  by  0 ^(1,  0)  and  c (1,  T)  (which  has  zero 

length  if  equality  holds  in  (3. 18)).  This  gives  w+  on  I and  (3.  11) 

P 

then  determines  w there,  so  that  the  whole  vector  w is  now  known 

on  I . The  extension  of  w from  I 0 A.  U A^  into  A is  essentially 

p p 0 T p 

the  Goursat  problem,  or  characteristic  initial  value  problem  [11],  [31]* 

Once  this  extension  is  completed  we  have  the  values  of  w(l,  t)  , 
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J 


0 < t < T,  on  x = 1 (and  hence  w(l,t),  w+(l,t)  there).  Then  (3.13) 

determines  u(t)  . The  basic  uniqueness  results  show  that  with  u thus 
obtained,  the  constructed  solution  w is  the  unique  solution  of  (3.  3), 
(3.7),  (3.11),  (3.13)  for  0 < x < 1,  0<t<T  and  we  clearly  have  ( 3.  19), 

completing  the  first  part  of  the  theorem.  If  S(t)  is  a group,  the  re- 
quirements for  which  we  have  elaborated  above,  the  process  can  be  re- 
versed to  obtain  a control  steering  backward  from  an  arbitrary  terminal 
state  w(.,T)  = w e L2([0, 1];  En)  to  w(* , 0)  = 0 . The  general  control 
problem  can  then  be  solved  by  adding  the  two  solutions  just  described. 

It  is  also  possible  to  do  the  whole  job  at  once  by  carrying  out  the  con- 
struction which  we  described  in  A in  both  A ^ and  A ^ when  S( t) 
is  a group.  This  completes  the  sketch  of  the  proof  of  the  theorem.  For 
further  detail,  see  [86],  [90],  [11],  [ 31].  We  make  one  further  remark, 
however.  If  one  takes  care  in  assigning  the  arbitrary  data  on  the  inter- 
vals I k = 1,  2,  . . . , n-1,  e.  g.  if  one  always  sets  such  data  equal 

K 

to  zero,  one  can  show  (see  e.  g.  [W])  that 


I , < K ||w  ||  - 

L ([0,  T];  Eq)  ° L ([0,1];  E ) 


+ VV  2 n 

1 L ([0,1];  E ) 


(3.20) 


for  some  K , K > 0 . 

0’  1 

While  the  above  argument  perhaps  best  illustrates  the  precise  role 


played  by  the  characteristics  c^  and  the  precise  degree  to  which  u is 


; 


A 
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unique  or  has  arbitrary  elements,  it  is  not  the  preferred  constructive 
method.  A variant  of  the  above  procedure  developed  by  Cirina  [9]  in 
i nonlinear  context  uses  a sequence  of  three  successive  solutions  of 
initial  - boundary  value  problems  in  standard  rectangular  regions  - 
obviously  far  more  convenient  for  numerical  purposes  than  our  extensions 
into  curvilinear  triangular  regions.  The  inequality  (3.20)  is  also  most 
easily  proved  in  the  context  of  his  procedure.  We  shall  have  more  to 
say  about  Cirina' s paper  in  Section  9. 

If 

1 , 1 . 
r dX  r dX 

< o Vx)  ‘ o Vx| 

it  is  rather  easy  to  see  that  the  control  problem  cannot  be  solved  in 
general,  even  for  — 0 . In  this  case  the  domains  of  dQtorminacy  of 
the  line  segments  t = 0,  0 <_  x 1,  and  t = T,  0 <_  x <_  1,  have  an 

intersection  with  non-empty  interior  and  the  two  determinations  of  w 
in  this  intersection,  the  one  via  w(x,  0)  = w^(x),  0 < x < 1,  and  the 

other  via  w(x,  T)  = w (x),  0 < x < 1,  are  in  general  inconsistent.  This 

is  clearly  a rather  minimal  result  and  we  expect  non-controllability  to 
hold  in  many  cases  where  T is  larger  than  this.  To  study  problems  of 
this  sort  and  other  control  questions  of  a more  complicated  nature,  one 
is  led  to  adopt  a more  indirect  approach. 

We  find  it  useful  to  adopt  the  general  abstract  scheme  set  forth 
in  Section  2.  We  take  the  spaces  X and  Z both  to  be  L2([0,l];En) 
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and  we  take  Y to  be  l/  ( [ 0 , T ] ; Eq)  for  whatever  value  of  T is  under 
consideration.  Choosing  for  the  present  to  specialize  in  the  control 
problem  with  w^  = 0 we  define  S:  X -»  Z by  solving  (3.  3),  (3.7),  (3.  11), 
(3.12),  thereby  obtaining  a solution  w(x,t),  0 < x < 1,  0<t<T,  and 

setting 

SwQ  = w(-,T)  . (3.21) 

(Thus  S = S(T),  the  latter  having  been  defined  in  (3.15).)  The  operator 
C;  Y — Z,  on  the  other  hand,  is  obtained  by  solving  (3.  3),  (3.11),  (3.13) 
with  w ( • , 0)  = 0,  thereby  obtaining  a solution  w(x,  t),  0<x<l, 

0 < t < T,  which  depends  on  u,  and  setting 

Cu  = w (• , T)  . (3-  22) 

(Thus  C = C(T)  as  in  (3.  15). ) 

From  (3.  15),  (3.  15)  we  see  that  S and  C are  both  bounded  operators 
in  this  case. 

Now  consider  the  adjoint  system 

= A(x)~  - (B(x)  - A '(x))v  (3-23) 

at  ax 

with  the  adjoint  boundary  conditions  (see  [V]  for  derivation) 

v+(0,t)  = -(A+(0))’1  D*  A"(0)  v"(0,  t)  , (3.24) 

v~(l,  t)  = -(A~(l))  1 D*  A+(l)  v+(l,t)  . (3.25) 

All  of  the  results  of  Theorem  3.  1 apply  to  (3.  23),  (3.  24),  (3.25)  pro- 
vided we  specify  terminal  data  (the  relevant  semigroup  is  defined  for 


t < 0) 
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(3.  26) 


v(-,T)  = v e l/([0,  1];  En)  . 


Continuing  with  the  general  scheme  of  Section  2,  we  define  S : Z -*  X 


s Vj  = v(-,0)  € X = L ([0,1];  En)  , (3.2 

where  v is  the  solution  of  ( 3.  2 3) -(  3.  26),  and  we  define  C;Z-Y  = 
LZ([0,T];Eq)  by 


w = C = D A (1)  v (1,  • ) € Y 


A rather  standard  computation  ([91])  shows  that  if  v satisfies  (3.2  3), 
( 3.24),  ( 3.  25)  while  w satisfies  ( 3.  3),  (3.11),  (3.  13)  then 


(v( ‘ , T),  w(*  , T)) 


2 n - (v(* , 0),  w(.,0)) 

L ([0,1];  E ) l/([0,l];En) 


= (D*A+(1)  v+(1,  • ),  u) 


L2([0,T];  Eq) 


For  u=0  we  have  (cf.(3.  21),  (3.27)) 


,vl’  SWo'  2 


L2([0,l];En)  ' *’  0lL2([O,l];En) 


and  for  w(-,0)  = w = 0 we  have  (cf.  (3.  22),  (3.  28)) 


(V1’CU)  2 n = (C  vl’U)  2 a ’ 

L ([0,  1];  E ) L ([0,T];Eq) 

£ 

confirming  that  S and  C are  the  adjoint  operator  to  S and  C , 


as  anticipated  by  the  notation. 
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We  are  now  in  a position  to  analyze  the  control  problem  for  the 


system  (3.3),  ( 3.  11),  (3.13)  with  the  aid  of  the  duality  results  embodied 

in  Theorems  2.6  and  2.7  of  the  preceding  section.  According  to  those 

theorems  we  may  study  the  control  problem,  in  this  case  that  of  steering 

from  wQ  e L^([0,l];En)  to  w^  = 0,  by  enquiring  whether  (C*,  S , Z,  Y,  X} 

is  distinguishable  or  observable,  respectively,  i.  e.  , whether  ( since 
# 4* 

D A (1)  is  nonsingular  in  (3.28)) 

v+(l,  t)  = 0,  t € [0,  T]  =»  v(x,  0)  = 0,  x « [0,  1]  (3.29) 

or,  for  some  K > 0 , 

UvV)ll  2 _ > K||  v(- , 0)||  , (3.30) 

L ([0,  T ] ; E ) L ([0,  1];  E ) 

respectively. 


We  will  first  consider  the  case  wherein 

B*(x)  = A’ (x),  x € [0, 1]  , 


(3.  31) 


which,  of  course,  includes  the  case  where  A is  constant  and  B = 0 . 

Defining  the  characteristics  c (x,  t)  as  before  we  see  that  for  (x(t),t) 

on  a characteristic  c,  we  have 

k 

d k 3v^  3 k 

— v (x(t),  t)  = (x(t),  t)  - \k(x(t))  ^ (x(t),  t)  = 0 , 

9v  9v 

since  (3.  2 3)  gives  — - A(x)  — =0,  and  A(x)  = diag( \ (x),  . . . , \ (x))  . 

This  means  that  v (x(t),t)  is  constant  on  a characteristic  c thus 

k’ 

greatly  simplifying  the  behavior  of  the  solution  v . 
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Proposition  3.  3 . for  i = p+1,  p+2,  . . . , p+q  let  T be  the  unique 
positive  number  such  that  the  solution  x(t)  of 


— -yx),  x(0)  - 1 


satisfies  x(T.  ) = 0,  and  for  j = l,  2,  , p 1 £t  x(T  ) = 1 . For 

j = 1,  Z,  . . . , p let  i be  the  largest  integer  in  the  set  { p+1,  . . . , p+q  } 

* 1j  * 

such  that  the  component  (D())  of  the  matrix  DQ  (cf.  (3.24))  is  non- 
zero if  such  an  i exists.  Define 


T.  + T, , i.  exists  , 
i.  j J 


T.  otherwise, 
1 > 


and  let 


TQ  = max  IT. 

i e lp+1,  . . . , P+q} 

j e (1,  2,  . . . , p} 


It,,  t.) 

i ) 


Assuming  (3.  31),  if  T < the  distinguishability  property  (3.  29)  (and 
hence  the  observability  property  (3.  30)  also)  cannot  hold. 

The  basic  idea  here  is  the  construction  of  paths  whereby  "information' 
can  be  sent  from  the  terminal  state  v = v(*,T)  to  the  initial  state 
Vg  = v(. , 0)  without  encountering  the  boundary  x = 1 where  the  obser- 
vation D A (1)  v ( 1,  t)  is  taken.  We  will  analyze  only  the  case  where, 


for  some  j and  i.  , we  have  T = T + T-  . Referring  to  Fig.  3.  3 
’°  *0 
we  note  that  if  T < TQ 
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Figure  3.  3.  Diagram  for  T < TQ 


the  characteristic  c.  (1,  T ) meets  the  line  t = T at  a point  (£,  T)  , 

J0  ° 


4 < 1 . We  define  a terminal  state  v(-  , T)  = by 


0 if  k * j or  if  k = j and 


Vj(x)  = 


* J a,  n ; 


1 if  k = jQ  and  x « (£,  1]  . 


Since  v is  constant  on  characteristics  c,  we  see  that  for  t > T - TV 

k . - j 

]0 

the  only  non-zero  part  of  the  solution  v is  the  component  v , which 


equals  1 in  the  strip  between  c.  (1,  T ) and  c ( 1 , T)  (and  is  equal  to 

j0  ° j0 


zero  outside  that  strip).  At  the  boundary  x = 0 we  have 


V (0,  t)  = 0,  t >T  - T k = ],2,...,n 

k -° 

v (0,t)  = 0,  t >T  - T k = l,2,...,p,  k * J 

0 

^0 

v (0,  t)  = 1,  T - T.  < t < T„  - T 
] 0 ] 

Jo  Jo 


We  now  consider  components  v , k e {p+1,  . . . , p+q  = n}  related  to 


v at  x = 0 by  the  boundary  condition  (3.24).  Since  A(0)  is  diagonal 

~ k 

the  precautions  taken  in  defining  the  T . show  that  those  v'  related 

J 

to  a characteristic  c,  (0,  s),  T - T < s < T - T , which  could  meet 

k Jo"  0 >0, 

the  segment  x = 1,  0 < t < T , are  related  to  v 0 by  a zero  coefficient 

+ - 1 * 

in  the  matrix  (A  (0))  DQ  A (0)  . This  enables  us  to  conclude 

v+(l,t)  = 0,  t c [0,  T]  . 

On  the  other  hand  we  shall  have 

i.  i. 

v V,  t)  = (A  (0)  D A (0)) 

J0 

between  the  characteristics  c,  (0,  T - T,  ) = c.  (1,  0)  and  c.  (0,  T - T.  ) 

ljo  J°  *0  ljo  J° 

so 


v0  = v(-  , 0)  A 0 


and  thus  (3.29)  does  not  hold.  The  other  possibilities  for  TQ  are 


analyzed  in  a comparable  manner. 

For  a positive  result,  we  have 

Proposition  3.4.  For  i = p+1,  p+2,  . . . , p+q  let  j be  the  largest 

* i 

integer  in  the  set  {1,2,  . . . , p}  such  that  the  component  (DQ),  of  the 

^i 
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matrix  DQ  is  non-zero,  if  it  exists.  Letting  T.  , i = p+1,  . . . , p+q  , 


T.  , j = 1,  2,  . . . , p,  be  defined  as  in  Proposition  3.  3,  let 


T.  = 

l 


T.  + T , j . exists, 

1 V 1 


T,  otherwise  , 
i 


and  let 


V 


max 


, A — < 

(T.,  T.  } 


is  {p+1,  . . . , p+q} 


i7  J 


j s {1,  2,  . . . , p} 


(3.  32) 


If  T > Tj  we  have  observability  in  the  sense  (3.  30). 

Remark.  The  time  TQ  of  Proposition  3.  3 does  not,  in  general,  agree 

with  the  time  defined  here.  This  raises  the  question,  unresolved 

at  the  moment,  concerning  the  identification  of  a "critical  time"  Tc 

such  that  observability  holds  if  T > T and  does  not  hold  if  T < T . 

c c 

Such  a critical  time  T can  readily  be  shown  to  exist  but  no  satisfactory 

c 

characterization  of  it  is  available  at  this  writing. 

Sketch  of  Proof.  Suppose  that 

S"'  v 1 = v(0,  T)  * 0 . 

Suppose,  then,  that  xe  (0,1)  and  k are  such  that 

vk(x,  T)  * 0 . (3.  33) 

If  k « {1,  2,  . . . , p}  the  characteristic  c (x,  T)  (1,  t),  0 < t < T,  and 

K. 

k — 

then  v (1,  t)  4 0,  so  that  v (1,  t)  # 0 . Since  (3.  25)  is  satisfied  at 
(1,  t),  we  will  have 
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V + (l,  T)  * 0 . 

On  the  other  hand,  if  k « { P+1,  • • • , then,  since  T>  we  will 

have  the  characteristic  c^  (x,  T)  meeting  x = 0 at  some  point  , 

0 < t < T and  v (0,  ) * 0 . Since  (3.24)  is  satisfied  at  x = 0,  some 

* k * 

component  (DJ,  of  D is  non-zero  and 
Or  0 

vf  (0,  * 0 . 

Now  (3.  32)  shows  that  T > + T"  , and  since  T > , cf(0,  meets 

l 

x = 1 at  a point  (1,  t^)  with  0 < < T and,  since  v is  constant  on 

( 

c (0,  t ),  we  have  v (1,  r ) ± 0 . Then,  as  before 
Q 1 •— 

vf(l,  t2)  # o . 

In  either  case,  by  letting  x vary  in  some  non-null  subset  of  (0,  1) 

where  (3.  33)  holds,  we  see  that  v (1,  • ) is  not  the  null  element  of 

L2([0,  T];  Eq)  and  distinguishability,  (3.29),  has  been  proved.  A 

slightly  more  careful  analysis,  keeping  track  of  the  norms  involved, 

establishes  observability,  (3.  30). 

The  duality  theory  of  Section  2 then  gives 

Corollary  3.  5.  For  the  hyperbolic  control  system  (3.  3),  (3.11),  (3.12) 

there  are  positive  numbers  T and  such  that  the  problem  of  control 

2 n 

from  an  arbitrary  e L ([0,  1];  E ) to  = 0 during  [0,  T]  is  not 

solvable  if  T < TQ  and  is  solvable  if  T > T^,  provided  ( 3.  31)  is  valid. 

* 

Our  next  task  is  to  see  if  the  restriction  B (x)  = A'(x)  can  be  re- 
moved. This  is  done  in  two  steps.  One  first  considers  the  case  wherein 
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B(x)  - A‘(x)  is  diagonal  (i.  e.  , B(x)  is  diagonal,  since  A(x)  is  already 
in  that  form).  Here  the  proofs  of  Proposition  3.  3 and  3.  4 are  virtually 
unchanged.  Although  the  v are  not  constant  on  characteristics  c^  , 
they  nevertheless  satisfy  uncoupled  scalar  linear  homogeneous  equations 


dv  ' 

"dT 


(x(t),t)  = (BT(x)  - A'(x))£  vk(x(t),t)  , 


k = 


and  hence  there  are  positive  0Q,  (3j  such  that 


0 < 
H0  - 


v (x(t),  t) 

k a 

v (x(t),t) 


±*1 


(3.  34) 


A A 

whenever  (x(t),t),  (x(t),  t)  lie  on  the  same  characteristic  c . The 

relationship  (3.  34)  is  clearly  just  as  useful  in  the  proofs  of  Propositions 
3.  3 and  3.4  as  the  constancy  of  v on  c^  which  we  assumed  there. 

The  next  steps  are  somewhat  more  subtle  and  are  in  the  nature  of 
a perturbation  argument.  We  have  to  divide  our  result  into  two  parts. 
Theorem  3.  6.  For  the  control  system  (3.  3),  (3.11),  (3.  1 3)  and  associated 
dual  linear  observed  system  (3.  23),  (3.  24),  (3.  25),  (3.  28)  we  do  not 
have  exact  controllability  to  = 0 and  observability,  respectively,  if 
T < T . (However,  this  conclusion  is  not  shown  to  extend  to  lack  of 

Q ! 

a pi  oximate  controllability  and  distinguishability,  respectively.  ) 

Sketch  of  Proof.  Let  the  operators  S and  C be  defined  as  in  (3.27), 
(3.28)  for  the  general  control  system  and  let  the  notations  Sd  and 
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be  employed  for  the  operators  related  to  the  corresponding  "diagonal 
system  wherein  (3.2  3)  is  replaced  by 


dv 


3v 


aT  = A<x>  ^ + <B  <x>  - A'<x»d  v > 


where  (B  (x)  - A’(x))^  denotes  the  diagonal  part  of  that  matrix.  A 
somewhat  involved,  but  not  conceptually  difficult,  argument  allows  one 

ijc 

to  see  that  the  operator  differences  S - S ,,  C - C , are  both  compact. 

d d 

When  T < T^,  as  in  Proposition  3.  3,  a slightly  more  careful 
analysis  shows  that  there  is  an  infinite  dimensional  subspace,  call  it 
ZQ,  of  L2([0,1];  En)  such  that 


S , v ||  > K.  II  v ||  , v,  e Zn  , 

^ L2([0,  1];  En)  L2([0,  l];En)  0 


(3.  35) 


for  some  > 0,  while 


Cd  V1 


°-  V1  £ Z0 


(3.  36) 


With  ( 3.  35),  (3.  36)  true,  it  is  a standard 


exercise  in  functional  analysis  to  show  that  we  cannot  have  (3.  30)  if 

sjc  £ >!: 


S - S , and  C - C , are  both  compact.  The  conclusion  that  we  do 
d d 

not  have  observability  of  (3.  23)  - (3.28),  and  hence  (exact)  controllability 
of  (3.  3),  (3.  11),  (3.  13)  for  T < follows.  Note  however  that  one  can- 
not show  that  (3.  29)  is  false. 

The  general  controllability  result  for  T > T^  is  considerably  more 
complicated.  Before  stating  the  theorem  we  present  an  example  to  show 
that  this  should,  indeed,  be  the  case.  Consider  the  two  dimensional 
system 
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(3.  37) 


L 


¥T<V1'  = «o  -°t'll<V2)"  0<X<1,  t>0 

V V 


with  boundary  conditions 


v\o,t)  = 0,  vZ(l,t)  = 0 


[3.  38) 


and  observation 


C*  v.  = C*  v(-,T)  = vV,  •)  « L2[0,  1]  . 


(3.  39) 


Here  one  readily  verifies  that  we  have  observability  for  T > Tj  = 1 . Indeed, 
no  matter  what  v^  = v(-  , 1)  may  be,  we  have 


S Vj  = v(-,0)  = 0 


Thus  the  observability  condition  (3.  30)  is  satisfied,  as  it  were,  by 
"default"  on  the  part  of  S rather  than  by  any  "strength"  of  the  ob- 
servation operator  (3.  39).  It  is  perhaps  not  surprising,  then,  that  a 
small  perturbation  in  (3.  37)  yields  a system  which  is  not  observable. 


Indeed,  for  e > 0,  let  (3.  37)  be  perturbed  to 


9 v1  1 0,9  v1  ,0  e , ,vJ 


9t  ( 2*  (0  -I!3x  1 2)i(0  0)(  21  ’ 


(3.40) 


v 


with  the  boundary  conditions  (3.  38)  and  the  observation  operator  (3.  39) 
being  retained.  One  may  then  verify  quite  readily  that  if  we  give  the 


terminal  state 


v\x,  1)  = (1  -x),  v2(x,l)  = l,  0 < x < 1 


1 2 

the  effect  of  the  coupling  of  v to  v in  (3.40)  is  to  give 


v (1,  t)  s 0,  0 < t < 1 


-42- 


that  C Vj  = 0 . But 

1 1 X 
v (x,  0)  - v (0,  x)  - f £ dt  = - ~ 

X 

2 

1 ^ 

so  v (x,  0),  and  hence  S = v(- , 0)  # 0 . It  follows  that  (3.40), 

( 3.  38)  , (3.  39)  is  not  even  distinguishable  on  [0,  1],  for  any  e > 0 . 

We  first  attempt  to  rule  out  this  sort  of  behavior  by  strengthening 

* 

S . This  we  do  by  assuming  the  system  (3.  3),  (3.11),  (3.  12)  to  be  time 
reversible,  the  requirements  for  which  we  have  set  down  earlier.  Then 
the  operator  (3.  14)  generates  a group  S(t)  for  t « ( -oo,  oo)  and  S,  S* 
are  invertible.  In  fact,  with  S , as  defined  earlier,  we  shall  have,  for 
Vj  € LZ([0,  1];  En), 


IS,  v II  n > M , ||  v II 

L ([0,  1];E  ) d L ([0,  1];  En) 


(3.41) 


Theorem  3.7.  Let  (3.  3),  (3.  11),  (3.  12)  be  time  reversible  (see  note  in 
statement  of  Theorem  3.  1)  and  let  T > T^  as  described  in  Proposition  3.4. 
Then  ’here  i:--  an  e > 0,  depending  only  on  T,  A(x),  the  diagonal 
elements  of  B(x),  and  the  matrices  C , D (cf.  (3.11),  (3.12),  (3.13)) 
such  that  if  each  off-diagonal  element  bj(x),  i*j,  satisfies  |b|(x)|<e 
x«  [0,1],  then  the  system  ( 3.23),  ( 3.24),  (3.25),  (3.28)  remains  ob- 
servable on  [0,  T]  (equivalently  (3.  3),  (3.11),  (3.13)  remains  controllable 
on  [0,T]). 


Sketch  of  Proof.  Since  we  have  observability  for  the  diagonal  system 
we  have 
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> 

l/([0,ll;Eq)  “ 


l/([0,l];En) 


> KM  ||  v || 

L ([0,1];  En),  VjC  L ([0,l];En)  . 


(3.4Z) 


Let  the  terminal  state  v be  given  and  let  v(x,  t)  be  the  resulting 
solution  of  the  diagonal  system  for  t < T,  v(x,  t)  the  corresponding 
solution  of  the  complete  system  (3.  23),  (3.  24),  (3.  25)  (with  the  off 
diagonal  elements  of  B(x)  restricted  as  above).  Then  v = v - v 
satisfies  the  boundary  conditions  (3.24),  (3.25)  again,  the  terminal 
condition 

v(x,  T)  = 0,  x € [0,  1]  (3.43) 


and  the  system  of  differential  equations  (cf.  (3.23)) 


9v 


8v 


8t  - A(X)9^  " (Bd(X)  " A'<x»v  + <B  M - Bd(x»(v  + v)  , (3.44) 


where  B^(x)  denotes  the  diagonal  part  of  B(x)  . Constructing  and 
estimating  the  solution  of  (3.  4 3),  (3.42),  (3.  24),  (3.25),  in  the  process 
making  use  of  the  fact  that  v can  already  be  bounded  in  terms  of 
Vj  = v(x,  T),  the  condition  on  the  elements  bj(x),  i*j,  of  B (x)  - B^(x) 
shows  that 


* * 

<s  -Sd'h1 


LZ([0,1]:  E"> 


I v (■ , 0) 


LZ([0,l];En) 


£ K(e)l|v  II 

L ([0,  1]  ;E  ) , 


(3.45) 
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||(C*  -C*)v.ll  n = (cf.  (3.  28))  ||  D A+(l)v  +(1,  • ) |1 

d ‘ L ([0,  1];  Eq)  L ([0,l];Eq) 

< L (e)  II V ||  7 (3.46) 

1 L ([0,l];En) 

for  positive  K(e),  L(e)  tending  to  zero  as  e — 0 . rrom  (3.41),  (3.42),  (3.45), 
(3.46)  one  easily  obtains  (3.  30)  for  some  M > 0,  provided  e is  sufficiently 
small,  and  the  proof  is  complete. 

We  remind  the  reader  that,  while  Theorem  3.  7 is  stated  in  terms  of 
the  operator  S defined  in  (3.21),  and  hence  directly  yields  only  the  con- 
trollability to  w(-,T)  = 0 for  (3.  3),  (3.11),  (3.13),  the  assumption  that 
this  system  is  time  reversible  guarantees  that  ^ (S)  = (I)  so  that  con- 

trollability to  any  given  terminal  state  follows  immediately. 

The  condition  that  the  off  diagonal  elements  be  small  can  be  re- 
laxed if  we  assume  approximate  controllability  to  hold  for  the  perturbed 
system,  i.  e. 


CV|r0^  S Vj-0  . (3.47) 

Since  S,  and  hence  S ' , is  boundedly  invertible  when  ( 3.  3),  (3.11), 
(3.12)  is  time  reversible,  (3.47)  is  the  same  as 

C"v1  = 0 =>  = 0 . 


* * 

Then  (3.42),  together  with  the  fact  that  C - is  compact,  as  noted 
earlier,  allows  the  Fredholm  alternative  theorem  [84]  to  be  applied  to 
give 


'CV2  n — ^1 

L ([0,  l];En) 


LZ([0,l];En] 


±K2 


I * 

Is  V, 


LZ([0,1] 


E") 
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for  positive  K , K . That  is,  with  a compact  perturbation,  as  we  have 

1 C* 

here,  distinguishability  implies  observability.  More  detail  is  given  in 
the  proof  of  a comparable  theorem  in  [16].  However,  this  result  is  not 
too  useful  because  distinguishability  is  rather  easily  destroyed.  We 
have  seen  this  to  be  the  case  in  the  example  given  prior  to  Theorem  3.7. 
There,  however,  the  system  is  not  time  reversible.  To  see  that  it  can 
still  happen  for  a time  reversible  system  we  consider  a second  example. 
We  form  the  four  dimensional  system 


d_ 

at 


1 -\ 

/ 

\ 

1 - 1 

/ 

h 

0 

0 

°) 

M 

1 0 

a 

0 

P 

vi 

V2 

0 

1 

* 2 

0 

0 

a 

V2 

1 

0 

0 

0 

0 

V 

2 

+ 

vi 

0 

0 

1 

0 

dx 

+ 

V1 

T 

0 

-P 

0 

- a 

vi 

1 d 

1 0 

0 

0 

1 

2 

0 

0 

0 

0 

with  a,  (3  yet  to  be  determined,  and  boundary  conditions 

V0’ t)  = v2(0’ V2(°’ t)  = V1  (0’  ^ ’ 

vl"(i,  t)  = v*(l,  t),  v-(l,  t)  = v*  (1,  t)  . 

The  observation  is 


MvA 


(3.49) 
(3.  50) 


C'v(-,T)=|  A J . (3.51) 

Clearly  we  have  p = q = 2 and  ( 3.  49),  (3.  50)  shows  that  (cf.  ( 3.  24), 


(3.  25))  we  must  have 
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characteristics  c^(l,  0),  c^(|,  0),  c~(l,  3),  c^(},  3),  the  segment  of  the 
t axis  between  t = 1 and  t = 2 and  the  segments  of  t = 0,  t = 3 between 
x = A and  x = ] . Further  we  let 

v^(x,  t)  £ 0,  (x,  t)  Lies  above  c^(l,  0)  in  Fig.  3.  4 , (3.  54) 

v~(x,  t)  s 0,  (x,  t)  lies  above  c~(l,  3)  in  Fig.  3.4.  (3.55) 


v2(l,t)  s v“(l,t)  = 0,  0<  t < 3 


(3.  56) 


and  leaves  v^  undefined  only  in  region  F,  v undefined  only  in  region 
G . 

Consider  the  characteristic  c^(0,  1+t),  where  0 < t < 1,  as 
shown  in  the  figure.  This  line  intersects  0^(1,  3)  at  the  point  (t,  1+2t) 
and  intersects  c2(l,  0)  at  the  point  ^ r,  y + —■  ) . Using  the 
boundary  conditions 

v~(0,  1+t)  = v2(0,  1 + t)  (=1  from  (3.  53),  if  0 < t < 1) 

and  integrating  the  first  equation  implied  by  (3.48)  along  the  characteristic 
Cj  (0,  1+t)  we  find  using  (3.52)  that 


Vj  (1,  2+  t)  = 1 + «t  + (3(j  - — T) 

= 1 - t - 3(  j - y r)  s 0,  0 < t < 1 


(3.  57) 


On  similar  analysis  using  the  characteristic  c^  (0,  1+  t)  gives 


/jtl,  T)  s 0,  0 < T < 1 . 


(3.  58) 
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For  t = 0,  0 < x < 4 we  put 


v^(x,  0)  = 1 , 

which  gives  vGx,  t)  s 1 in  region  F . For  t = 3,  0 < x < 4 we  put 


v"(x,  3)  - 1 


which  gives  v (x,t)  = 1 in  region  G . Integrating  over  characteristics 


c (0,  t),  c (0,  2 4-t)  one  find  that 


Defining 


v (1,  1 + t)  = 0,  0 < t < 1 


v ( 1 , 2+t)  s 0,  0 < t < 1 . 


Vj’fx,  0)  = 1,  0 < x < 4 , 

v^fx,  3)  = 1,  0 < x < 4 , 

Vj'fx,  0)  = Z(l-x),  4 < x < 1 , 

Vj  (x,  3)  = 2(l-x),  4 < x < 1 , 


and  integrating  over  characteristics  c^  (x,  0),  c ' (x,  3)  one  has 

v"(l,  x)  S 0,  0 < x < 1 , 

v+(l,  3-x)  = 0,  0 < x <1  . 


(3.  59) 


( 3.  60) 


(3.61) 

(3.62) 


Finally  we  note  that  (3.  49)  , (3.50),  (3.  54),  ( 3.  55)  show  that  v*(x,  t)  = 0 
below  Cj  (0,  1)  and  v^  (x,  t)  = 0 below  Cj  (0,  2)  . All  quantities  have  now 
been  defined  consistent  with  (3.48)  and  (3.49),  (3.  50).  The  solution  is 

clearly  not  identically  equal  to  zero  at  t = T = 3 but  (3.  54)-(3.  62)  show 

* 2 ? 
that  C v( • , 3)  (cf.  (3.  51))  is  the  zero  element  of  L ([0,  3];  E ) . We 
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conclude  that  ( 3.  48),  (3.49),  (3.50),  ( 3.  51)  is  not  observable  in  [0,  3] 
when  a and  0 are  given  by  (3.  52). 


To  summarize;  as  seen  in  the  example  before  Theorem  3.7,  when 
the  system  (3.  3) , ( 3.  11),  (3.  i2)  is  not  time  reversible  it  is  possible  that 
observability,  as  defined  in  terms  of  ( 3.  24),  (3.  25),  (3.26),  (3.28),  can 
be  destroyed  by  arbitrarily  small  perturbations  in  the  off-diagonal  elements 

sjc 

of  the  matrix  B (x)  . Theorem  3.  7 shows  this  cannot  happen  if  the  system 
is  time  reversible,  the  perturbations  then  must  be  suitably  large  to  destroy 
observability.  The  example  immediately  preceding  this  paragraph  shows 
that  observability  and  even  distinguishability  can  be  destroyed  by 
"suitably  large"  perturbations  for  T = Tj  . 

Now  we  come  to  a rather  curious  punch  line.  The  fragility  of  ob- 
servability of  ( 3.  24),  (3.25),  (3.26),  (3.  28)  (and  hence  of  controllability 
of  ( 3.  3),  ( 3.  11),  ( 3.  1 3))  exhibited  in  these  examples  for  T = T^  (=1  in  the 
first  example,  =3  in  the  second  example)  does  not  occur  if  we  use  the 
control  time  (3.18)  of  Theorem  3.2.  The  constructive  proof  of  controlla- 
bility described  there  is  in  no  way  affected  by  any  changes  in  the  matrix 
B(x)  - it  works  for  every  continuous  B(x)  . Thus  in  the  example  before 
Theorem  3.7  we  cannot  destroy  observability  via  B(x)  if  we  take  T = 2 , 
and  in  the  example  following  that  theorem  we  cannot  destroy  observability 
via  B(x)  if  we  take  T = 4 . In  other  words,  the  observability  and 
controllability  associated  with  the  "overly  generous"  time  interval  defined 
in  Theorem  3.  2 are  more  "robust"  than  those  associated  with  the  generally 
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more  precise  time  interval  defined  in  Proposition  3.  4 and  Corollary  i.  3. 
This  casts  some  doubt  on  the  usefulness  of  these  latter  results  in  practice 
and  makes  the  search  for  the  critical  time  T , described  after  the 
statement  of  Proposition  3.  4,  of  rather  academic  (if  any)  interest. 

In  Section  2 we  have  noted  the  close  connection  between  control- 
lability and  stabilizability  which  obtains  in  the  case  of  finite  dimensional 
linear  control  systems.  We  now  proceed  to  show  that,  at  least  in  part, 
this  relationship  can  be  extended  to  the  infinite  dimensional  control 
systems  which  we  have  been  discussing  here.  For  the  general  system 
(3.3),  (3.11),  (3.  12)  our  results  are  quite  incomplete.  A special  case, 
to  be  discussed  in  Section  4,  can  be  treated  much  more  satisfactorily. 

The  class  of  systems  which  wre  shall  treat  is  that  described  by 
(3.3),  (3.11),  ( 3.  12)  with  the  additional 
Assumption  3.  8 . For  each  w e E and  x € [0,  1] 


(w,  ( B (x)  + B(x)  - A'(x))w)  <0  . 

E 

p 4-  q 

Further,  for  each  vector  w e E , w € E , and  some  y > 0, 

(w‘,  (A"(l)  + Eu  A+(1)D  )w")  <-yl|w~||2 

1 EP  EP 


(3.  63) 


<w  + , (DV(0)  Dq  + A+(0))w+)  < 0 . 


(3.  64) 
(3.65) 


For  continuously  differentiable  solutions  of  (3.  3),  (3.11),  (3.12) 
(which  correspond  to  initial  states  w(-  , 0)  = w^  in  the  domain  of  the 
generator,  L,  of  the  semigroup  S(t)  described  in  Theorem  3.  1)  we  can 
study  the  evolution  of  the  norm  of  the  solution  w(-  , t)  by  computing 
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1 


~ II  w(. , t)  )|z  = ~ f ||w(x,  t)HZ  dx 

dt  L2([0,l];En)  dt  '0  En 


1 , 


dw 

^(alT(x> l)>  A(x)w(x>  t>)  n 


+ (w(x,  t),  A(x)  ~-(x,  t))  n f (w(x,  t),  (B  (x)  f B(x))  w(x,  t)  Jdx 

E E 


(integrating  the  first  term  by  parts)  = 


= / (w(x,  t),  (B  ‘ (x)  + B(x)  - A‘(x))  w(x,  t))  dx 
0 E 


+ (w(l,  t),  A(  1)  w(l,  t))  - (w(0,  t),  A(0)  w(0,  t))  n 

E E 


Using  the  first  part  of  Assumption  3.  8 and  the  boundary  conditions  (3.  11) 
and  (3.  12)  we  now  have 

llw(-,t)||2  n <(w"(l,t),  (A"(l)  + D*  A + (l)  D )w"(1,  t) ) 

L ([0,  l];En)  EP 


-(w+(0,  t),  (D*  A"(0)  Dq  + A+(0))  wf(0,  t))  q < (using  (3.  64),  (3.  65)) 


- yllw"(l,  t)||  . 

E 

The  integrated  form  of  this  inequality,  i.  e. , 

T 

l|w(-,0)||2?  n - ||w(-,T)||2  > v f ||w'(l,t)||2  dt 

L ([0,  l];En)  L ([0, 1 ] ; E ) *0  EP 


Y II  w ~ ( 1,  • )|| 

L ([0,  T];EP) 


(3.66) 
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remains  true  for  general  zed  solutions  of  (3.  3),  (3.11),  (3.12)  corresponding 
to  arbitrary  initial  states  w(-  , 0)  « L~([0,  1];  En)  . 


! 


i 


In  physical  applications  the  inequality  (3.63)  of  Assumption  3.  8 
states  that  there  is  no  energy  source  in  the  interior  of  the  medium  under 
consideration.  Likewise  (3.6  5)  states  that  there  is  no  energy  source  at 
the  boundary  x = 0 . The  inequality  (3.64)  is  stronger,  implying  that 
there  is  an  actual  loss  of  energy  at  x = 1 which  is  greater  than  or  equal 
to  a certain  positive  fraction  of  the  norm  of  the  "incoming  information" 
at  x = 1 at  the  given  instant  t . We  remark  that  (3.64)  and  (3.65)  are 
true,  respectively,  if  Dj  and  D^,  respectively,  are  sufficiently  small. 
In  some  cases  this  might  not  be  true  a priori.  One  might  then  augment 
(3.  12)  to  (3.  13)  by  use  of  a control  variable; 

w+(l,  t)  - Dj  w'd,  t)  + Du(t)  . 

A boundary  "feedback"  relation 

u(t)  = K w"(l,  t)  , (3.  67) 

expressing  the  control  u(t)  in  terms  of  the  "incoming  information"  at 
x = 1 at  the  instant  t could  then  be  employed  to  give 

w (1,  t)  = Dj  w (1,  t)  = (Dj  + DK)  w (1,  t)  . 

A 

Since  D is  assumed  nonsingular,  K could  be  selected  so  that  Dj  is 
small  and,  consequently,  (3.64)  holds. 

One  suspects  from  (3.66)  that 
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A 


= 0 . 


(3.68) 


lim  II  w(-  , t)D 

t-«  lT([0,l];En) 

This  is  in  fact  true  and  even  more  can  be  established;  namely,  that  there 

is  a uniform  exponential  decay  rate  for  the  norm  of  the  solution.  The 

quantity  II  w( - , t)||  is  being  used  here  as  a Liaponnov  function 

L (fO,  l];En) 

and  the  inequality  (3.66)  guarantees  that  it  is  non-increasing  and,  in 
fact,  decreasing  whenever  w~(l,  t)  + 0 . The  situation  is  quite  analogous 
that  encountered  in  the  study  of  the  damped  harmonic  oscillator 

x + x + x = 0 . (3.  69) 

The  energy,  ^[(x)2  + (x)2],  here  obeys 

^(i[(x)2  + (X)2])  = -(X)2 

and  decreases  when  x + 0 . Equation  ( 3.  69)  is  usually  analyzed  by 
recourse  to  the  so-called  "invariant  set  principle"  ([52])  which  states 
that  the  solution  tends  to  the  largest  subset  of  {(x,  x)|x  =0}  which 
is  invariant  under  (3.69).  Then  one  shows  that  x cannot  vanish  on 
any  interval  so  this  invariant  set  must  be  the  origin. 

In  the  case  of  (3.  66)  we  have  a rather  novel  situation  in  that 
w (1,  t)  may  vanish  on  an  interval  of  positive  length.  In  fact,  if  the 
support  of  w(-  , 0)  is  restricted  to  [0,  e]  then  we  shall  have  w(l,  t) 

(and  hence  w"(l,t))  h 0 for  t e [0,  t), 

1 . 

t = . r — — — — . 
t N(x) 

It  would  nevertheless,  be  possible  to  use  the  extended  invariant  set 
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principle  ([12],  [ 34]  and  applications  in  [94],  [ 10 S]  to  establish  (3.  68). 

We  will  indicate  how  this  is  done  in  a slightly  different  context  in 
Section  6.  However  this  approach  by  itself  would  only  yield  ( 3.68)  and 
not  an  estimate  of  the  rate  of  decay.  Consequently  we  elect  to  go  via 
a slightly  different  route  (but  which,  in  the  end,  amounts  to  more  or  less 
the  same  method). 

What  we  will  obtain  is  an  estimate 

T - 2 2 

f II  w ~ ( 1 , t ) |l  dt  > (3  II  w(-  , T) ||  , 

0 EP  ~ L ([0,1];  En) 

which,  in  the  light  of  the  earlier  work  of  this  section,  one  immediately 
recognizes  as  an  observability  type  result,  w (1,  • ) being  the  observation. 
In  [82],  where  exponential  decay  of  solutions  of  ( 3.  3),  (3.11),  (3. 12)  under 
the  hypotheses  of  Assumption  3.  8 was  first  (to  our  knowledge)  obtained, 
the  authors,  Rauch  and  Taylor,  obtain  this  "observability"  result  directly. 
Because  we  already  have  Theorem  3.  2,  we  elect  to  proceed  by  a "dual" 
method  making  use  of  controllability. 

Consider  the  dual  system  ( 3.  2 3),  ( 3.24),  ( 3.25),  but  augmented 
with  a control  vector  f e EP  so  that  (3.25)  is  replaced  by 

v'(l,  t)  = -(A'(l))"1  Dj'  A+(l)  v+(l,t)  + f(t)  . (3.70) 

Now,  as  noted  earlier,  solutions  of  ( 3.  2 3),  (3.24),  (3.7  0)  are  defined 
corresponding  to  specified  terminal  data.  The  control  problem  which 
we  pose,  then,  is  to  steer  the  terminal  state 
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(3.71) 


to  the  initial  state 


v(-,T)  = v £ L2(fO,l];En) 


v(-  , 0)  = 0 


by  use  of  the  control  f £ ( [0 , T] ; EH)  . Taking 


T > r‘  --d*  - . / -*L 
~ 0 Vl(X)  0  I * * * * * * * *  Xp(x) 


as  before,  Theorem  3.  2 applies  (with  the  time  sense  reversed)  to  show 
that  this  control  problem  has  a solution.  Moreover,  the  estimate  (3.20) 
applies;  translated  into  our  setting  it  reads 


I <K  llv(.,T)ll 

L ([0,T1;EP)  L ([0,  1];E  ) 


(3.  73) 


The  idea  now  is  to  make  a particularly  advantageous  choice  of 
v(-,T).  Let  w be  the  solution  of  ( 3.  3),  (3.11),  ( 3.  12)  under  considera- 

tion, for  which  we  have  developed  the  inequality  (3.  66),  and  with  T 

satisfying  ( 3.  18),  let 


v(-,T)  = Vj  = w(-,T) 


(3.74) 


Letting  f be  the  control  described  above,  but  for  this  particular  terminal 

state,  we  use  a computation  quite  similar  to  that  used  to  obtain  (3.66) 

(but  now  involving  — (w(-  , t),  v(-,t))  ) to  obtain 

l/([0,l];En) 


|w(-,T)ir  = (from  ( 3.  7 1))  (w(- , T),  v(- , T))  _ 

L([0,l];En)  L ([0,  1];E  ) 


-(w(-,0),  v(-,0))  =f  (w"(l,t)  A‘(l),  f(t))  dt 

L([0,l];En)  0 EP 


< (Schwarz  inequality)  a||w  ( 1 , - ) II  . n*n  2 n > 

L ([0,  T];E  ) L ([0,  T];EM) 


with  a > 0 depending  only  on  T,  A (1)  . Then  from  (3.7  3)  we  have 


II  w(  - ,T)  II 


Iw'(l,  • 


> 

L ([0,T];EP) 


L ( T o , 11;  E 


Then  ( 3.  66)  gives 


II  w(-  , 0)|| 


LZ([0,  l];En) 


- II  w(-  , T)  ||  2 


2 n - l|w(,’T)l1  2 
L ([0,  1];E  ) a KQ  L ([0,  1];E  ) 


so  that 


if w(*  ,T)|| 


1 


L2([0,  l];En)  l±y/aZKZ0 


II  w(-  , 0)  || 


L2([0,l];En) 


Since  the  coefficient  matrices  in  (3.  3),  (3.  11),  (3.  12)  do  not  depend 
on  t,  the  above  argument  may  be  repeated  on  successive  intervals 
[kT,  (k+l)T],  k=  1,2,3,...  to  give 


II w(- , (k+l)T)  II 


< II  w(-  , kT)  ||  2 

L2([0,  l];En)  14-y/a  L ([0,  1];E  ) 
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whence 


||  w(-  , kT)  ||  Z 1 ( 4r)k  N w(*  > °)  II  ^2  n • 

L ([0,I];E  ) 1+y/a  KQ)  L ([0,  1];E  ) 

Since  (3.66)  shows  the  norm  of  w(-,t)  to  be  non-increasing  with  t , 

-1  ,2  2. 

we  have,  for  some  M > 0 and  p = T log(l  + y a K^)  , 

Theorem  3.  9.  Let  w = w(x,  t)  bo  i generalized  solution  of  (3.  3),  (3.11), 
(3.  12)  and  let  Assumption  3.  8 be  satisfied.  Then,  for  t > 0 , 


|w(-  , t) 


L ( [ 0 , 1 ] ; E 


< Me  fat  ||  w(-  0)  II 
n — r 2 


L ([0, 1];E  ) 


i.  e.  the  semigroup  S ( t)  described  in  Theorem  3.  1 has  the  property 


S(t)||  < M ? e ~ 2 Kt, 


t > 0 . 


This  theorem  has  some  features  in  common  with  Theorem  2.  9.  In 
fact  one  can  give  a proof  that  the  feedback  relation  (2.  21)  stabilizes 
the  finite  dimensional  control  system  (2.  1)  which  exactly  parallels  the 
proof  given  above  for  Theorem  3.  9.  But  we  are  far  from  a "controllability 
implies  stabilizability"  type  result.  In  [103],  [105]  Slemrod  has  given  an 
extension  of  the  Lukes -Kleinman  stabilization  method  to  a wide  class  of 
(exactly)  controllable  systems  in  Hilbert  space.  That  method  can  be 
extended  to  cover  the  present  case  also  but  its  development  in  the  context 
of  the  present  boundary  control  system  would  be  much  too  lengthy  and 
technical  for  presentation  here.  In  effect,  Slemrod's  Fesult  would  pro- 
vide a formula  comparable  to  (2.  21).  In  the  present  situation,  assuming 
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controllability  of  (3.  3),  (3.11),  ( 3.  1 3)  on  some  interval  (0,  T],  but  not 
requiring  Assumption  3.  8,  that  method  would  provide  a feedback  lav; 

(cf.  ( 3.  67))  of  the  form 

1 

u(t)  Kw'(l,t)  t J L(x)  w(x,  t)dx 
‘ 0 

for  which  the  closed  loop  system  consisting  of  (3.  3),  (3.11)  and  (cf. 
(3.12)) 

1 

w+(l,  t)  = (D  +•  DK)  w"(l,  t)  f D f L(x)  w(x,  t)dx 
1 0 

would  have  uniform  exponential  decay  for  ||w(-  ,t)||  n as 

LZ([0,1];E  ) 

t — &o  . in  [90],  by  a different  procedure,  precisely  this  result  has  been 
obtained  for  a two  dimensional  system  of  the  form  (4.  1).  However,  a 
much  more  complete  theory,  comparable  to  the  last  statement  of  Theorem 
2.  9 has  recently  become  available  for  these  simple  two  dimensional 
systems  and  will  form  the  basis  for  our  discussion  in  Section  4. 


r ^ 


4-  CONTROLLABILITY  VIA  HARMONIC  ANALYSIS,  CONTROL  CANONICAL 
FORM,  SPECTRAL  DETERMINATION 

Here  we  consider  a very  special  subclass  of  the  hyperbolic  systems 
treated  in  Section  3 and,  at  least  at  the  outset,  we  postulate  a different 
form  of  control  action  from  that  of  the  boundary  control  (3.  13).  The 
differential  equation  is 


9 , 1 , .0  1 9 1 1 

Bt  W2  ~ 1 o ) ax  (W2  ) + B(X)  (w2)  + g(X)  U(t)  • 

W W W 


(4.  1) 


.0  1 


Here  A = ^ Q)  is  symmetric  with  eigenvalues  ±1,  B(x)  is  assumed  a 
continuous  2X2  matrix  on  [0,  1]  and 


g(x)  = 


9 (x) 

gZ(x), 


is  a vector  in  L ([0,  1];  E“)  . The  control  u is  scalar.  The  boundary 
conditions  under  consideration  take  the  form 


jw  (0,  t)  + bQw  (0,  t)  = 0,  ajW  (1,  t)  + b^w  (1,  t)  = 0 . (4.  2) 


The  requirement  for  existence  and  uniqueness  of  solutions  both  for  t 
positive  and  for  t negative,  from  the  discussions  before  and  after 
Theorem  3.  1 of  Section  3,  here  take  the  form 

(al  ' bl)(a0  + V * ° ’ 

(ai + bi)(ao  - V * 0 • 

so  that 
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^s,Vbi)(Vbo)/lVbi)<VV 


(4.  3) 


is  a well  defined  quantity. 

An  example  arises  very  readily  from  the  controlled  wave  equation 


3 w a w 

at2  ax2 


= g (*)  u(t) 


with  boundary  conditions 


0 w 0 W 0 W 0 W 

an  ~a7  'M)  + bn  = °<  + b,  *)  = 0 • < 4 ■ 5) 


, 1 3w  2 9w 

With  w = — , w = — , (4.4)  gives 


9w^  3w2 

at  dx 


+ g (*)  u(t) 


and  the  familiar  commutativity  relation 


2 2 2 
a w a w a w 

3t3x  3x9t  8x3t 


(4.6) 


becomes 


3w2  3w^ 

3t  9x 


(4.7) 


Equations  (4.6),  (4.7)  combine  to  give  (4.1)  with 


B(x)  = 0,  g(x)  = (9^X))  . 


Under  the  assumptions  set  forth  above  the  operator 


°)^(5)  +B,x,($ 


(4.8) 


2 2 

defined  on  the  domain  in  L ([0,1];  E ) consisting  of  two  dimensional 

vector  functions  fwJx^  ] with  derivatives  in  L2 ( [0 , 1 ] ; E2)  and  satisfying 
'w2(x)' 

boundary  conditions  of  the  type  (4.  2)  has  a rather  simple  spectral 
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decomposition.  The  eigenvalues  take  the  form  (cf.  (4.  3)) 


a,  = 7 log  y + kin  + e = a + k ni  + e,  , 

K K K 

_ 00  < k < oc  , 


(4.  9) 


where 


ek 


The  two  dimensional  eigenfunctions,  associated  with  these 

eigenvalues  <r^,  are  known  to  form  a Riesz  basis  (linear  isomorphic 

2 2 

image  of  an  orthonormal  basis,  see  [66])  for  the  Hilbert  space  L ([0, 1];E  ) 


Each 


("z) 

' W ' 


in  this  space  has  an  expansion 

V _ 

k=-CC 

and  these  are  positive  numbers  d,D  such  that 


(w2)=.£  3k\ 

\W  ' 


rt 

' TAT  • 


zi  l |ck' 

'w“'  L ([0, 1];E  k=-» 


\w  ' 


w“'  L2([0, 1];EZ) 


When  B(x)  = 0 the  e are  all  zero  and  the  c r and  4>  can  be  computed 

K rC  K 

explicitly.  In  the  general  case  we  refer  the  reader  to  the  very  wide 

existing  literature  in  eigenfunction  expansions. 

2 2 

Since  g e L ([0, 1];E  ) it  has  an  expansion 

00 

3 = l 3k  ♦k  • 

k=-<» 
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It  turns  out  that  the  ippropriate  controllability  assumption  is 


gk  * o,  - cc  < k < oo  , 


which  amounts  to  saying  that  the  "control  distribution  function"  g(x) 
has  a non-zero  component  in  the  direction  of  each  of  the  eigenfunctions 

Vx>  • 

We  are  also  going  to  assume,  for  the  sake  of  simplicity  of  present- 
ation, that  the  <r,  are  all  distinct,  which  is  not  implied  by  (4.  9)  except 
k 

in  an  asymptotic  sense.  This  is  not  an  essential  restriction;  the  theory 
to  be  developed  below  can  be  extended  without  difficulty  to  the  case 
where  the  operator  L has  multiple  eigenvalues. 

Let  us  pose  the  control  problem  here  in  terms  of  control  from  the 
zero  initial  state 


/wl  (•  , 0) 

W2(-  , 0) 

to  a prescribed  terminal  state 


wV,T) 

w>,T) 


0C 


k=-«» 


ck*k 


(4.  10) 


(4.  11) 


the  latter  sum  being  the  expansion  of  the  given  terminal  state  in  terms 
of  the  eigenfunctions  of  the  operator  L . It  is  required  to  find  a control 
u € L2[0,  T]  steering  a solution  of  (4.  1),  (4.  2)  from  (4.  10)  to  (4.  11)  during 
[0,  T]  . If  we  expand  this  solution  in  terms  of  the  eigenfunctions  4>  ^ of 
L : 
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the  coefficients 


wk(() 


satisfy  the  scalar  linear  nonhomogeneous  equations 


dw, 

k 

dt 


'k  wk  * h u|t)- 


- 30  < k < , 


(4-  13) 


w (0)  = 0 , -20  < k < x . 

Consequently  for  any  T > 0 we  have 

T o-  (T-t) 

w^(T)  - f e u(t)dt,  - so  < k < =o  . (4.14) 

0 

Comparing  (4.  11)  and  (4.  14),  our  control  objective  is  satisfied  if  and  only 
if 


= /T  e<Tk(2"t)  gkU(t)dt> 


- 20  < k < 30 


(4. 15) 


The  set  of  equations  (4.  15)  constitutes  a moment  problem  for  the 

2 

unknown  control  function  u e L [0,  T]  . Such  moment  problems  have  a 

long  and  distinguished  mathematical  history.  When  a = 0,  = 0 , 

o-  t k^t 

-oo  < k < ao,  the  functions  e = e are  the  well  studied  functions 

of  the  Fourier  series.  It  is  familiar  that  these  functions  are  linearly 
2 

dependent  in  L [0,  T]  if  T < 2 (but,  nevertheless,  span  the  space). 

2 

If  T > 2 these  functions  fail  to  span  L [0,  T]  but  are  independent. 
Precisely  for  T = 2 we  ha.e  both  completeness  and  independence; 
indeed,  the  functions  ^1/V2  je  * form  an  orthonormal  basis  for  L [0,2]  . 
Classical  results  in  harmonic  analysis  state  that  the  "non-harmonic 

V 

Fourier  series",  based  on  the  functions  e (cf.  (4.9)),  behave  in 
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much  the  same  way  as  the  usual  Fourier  series,  based  on  the  functions 

k-n-it  cr^t  ^ 

eKTT  . The  e ' span  L^[0,T],  but  are  not  independent,  if  T > 2 . 
When  T = 2 they  form  a Riesz  basis  for  L^[0,  2]  . What  this  means 


(previously  stated  for  the  eigenfunctions  <j>  of  L ) is  that  there  is 

k 

2 

a bounded  and  boundedly  invertible  linear  transformation  F:  L [0,2] 


Lu[0,  2]  such  that 


r,  k*K  k 

F(e  ) = e 


30  < k < 20 


# -1 

The  operator  (F  ) has  some  significance.  If  we  set 


we  have 


, # -1  kiT  i • , , , „ 

(F  ) (e  ) = Pk,  -20  < k < =0 


<e  , PJ 


L [0,2] 


= (F(ek7Tl‘),  (fY1^*1’))  2 


L [0,2] 


= (ek^i-  e^i, 


2 f 

L [0,2] 


This  is  summarized  by  saying  that  the  p^  form  a dual  Riesz  basis  for 

V 2 

the  e in  L [0,2]  . This  result,  which  has  been  known  since  the 
work  of  Paley  and  Wiener  [7  3]  in  the  1930's,  has  been  improved  in 
various  ways  by,  e.  g.  , Levinson  [ 56],  Schwartz  [99],  Kadec  [46],  and 
many  others.  A more  complete  list  of  references  is  presented  in  [87]. 

A very  easy  proof,  requiring  little  background,  appears  in  [84].  For  the 
most  part  the  theory  makes  essential  use  of  the  Fourier  transform  and 
very  technical  estimates  on  the  infinite  product 
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k- 


(1  --5-)  IT  [d  -^)d  --5->] 


0 k=l 


-k 


z 

(if  any  a,  = 0 the  corresponding  term  1 - — is  replaced  by  z)  and 

K O'  | 

k 

the  associated  products  obtained  by  eliminating  one  factor  at  a time. 

This  approach  requires  a = 0 but  does  not  at  all  require  that  the 

in  (4.  9)  be  square  summable.  It  is  enough  to  have  le,  | < p < t for 

k — 4 

all  sufficiently  large  k . The  result  can  trivially  be  extended  to  a * 0 

(kiri+ek)t;  at 

just  by  multiplying  each  of  the  functions  e by  e and  each 

(kTri+ek)t  _-t 

of  the  functions  p^  dual  to  e by  e 

With  the  above  background  we  can  immediately  provide  a formal 
solution  for  the  moment  problem  (4.  15)  when  T = 2,  i.  e.  , for 


2 *r  (2-t) 

c - f e g u(t)dt,  -°c  < k < oo  . 

k o k 

For  convenience  we  redefine  the  p^  (t)  by  p^  (2 -t) ) so  that 


(4.  16) 


f2  'kV-V  j 

/ e P.(t)dt  = 6 


(4.17) 


Then 


l W’V*’ 

f=-CC 


(4.18) 


solves  (4.  16)  in  a formal  sense.  We  have  a bona  fide  solution 
2r 

u « L [0,  2]  just  in  case 


l 

( = -00 


< 00 


(4.  19) 
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(Because  (4.18)  is  the  image  under  (r  ) of  ( — ) e ) • The 
cr  t f =-»  q( 

k 2. 

fact  that  the  e are  linearly  dependent  in  L [0,  T]  for  T < 2 shows 

that  (4.  15)  cannot  be  solved  in  general,  even  formally,  if  T < 2 . The 

2 

fact  that  the  e are  linearly  independent,  but  fail  to  span  L [0,  T], 

when  T > 2,  implies  that  (4.  15)  can  be  solved,  but  not  uniquely,  when 

T > 2 . We  see  then  that  our  control  problem  (4.  1),  (4.  2),  (4.  10),  (4.  11) 

has  a unique  solution  in  L~[0,  2]  just  in  case  T = 2 and  (4.  19)  holds. 

2 

Referring  to  the  general  abstract  framework  of  Section  2,  with  Y = L [0,  2] 
2 2 

Z = L ([0,  2];  E ),  the  operator  C is  defined  by 


oo  2 <r,  (2-t) 


k=-oc  0 


g}  u(t)dt)c>k 


(4.  20) 


2 2 2 
and  maps  L [0,2]  into  a dense  subspace  G C L ( [ 0 , 2 ] ; E ) consisting 

00 

of  secuences  Y c,  <j>,  for  which  (4.  19)  is  true.  Taking  X also  to 
. *->  k k 

2 k=_0° 

be  L ([0,2];  E2),  if  we  choose  S = I,  [X,  Y,  Z,  S,  C)  is  approximately 
controllable.  On  the  other  hand  if  we  let  S be  defined  by  S = g,  , 

-so  < k <x),  then  [X,  Y,  Z,  S,  C}  is  (exactly)  controllable. 


If  we  consider  instead  of  (4.  1),  (4.  2)  the  boundary  control  system 


o)  iri + bKwz)  ■ 

' ' ' ' TAT  ' TA7 


(4.21) 


aQ  w (0,  t)  + bQw  (0,  t)  = 0 , 


(4.22) 


1 2 

a w (1,  t)  + b.  w (1,  t)  = u(t) 


(4.  23) 


the  results  of  Theorem  3.2  show  the  system  to  be  uniquely  (since  there 


I 


are  in  this  case  no  intervals  I on  which  arbitrary  data  need  be  specified) 
controllable  just  in  case  T = 2 . That  this  time  coincides  with  that 
developed  above  is,  of  course,  hardly  a mere  coincidence.  One  can 
show  (see  [90]  for  details)  that  the  control  problem  (4.  10),  (4.  11),  posed 
for  (4.21),  (4.22),  (4.23)  is  equivalent  to  a moment  problem 


2 <r  (2-t)  ^ 

: = f e g,  u(t)dt,  ~x  < k < * 

k o < 


(4.24) 


with 


ra~1  4>jj(l),  at  * 0 , 


bl 1 bl 


* 0 . 


Analysis  of  the  differential  equation 

1 /n  rs  / 1 » 


■a -{-ua 


satisfied  by  <t>,  enables  one  to  see  that  there  are  positive  numbers  y 

K U 


and  such  that 


Y0  1 l9kl  £ Yj,  -x  < k <oo  . 


In  this  case,  therefore,  the  range  space  G of 

2 o-  (2-t)  a 

C u = V ( f eK  gku(t)dt)4>^ 
k--oc  0 


(4.25) 


2 2 

is  precisely  L ([0,2];  E ),  just  as  we  would  have  from  Theorem  3.2, 
and  {X,  Y,  Z,  S,  C } is  exactly  controllable  with  S = I . In  fact,  this 
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route,  Theorem  3.  2 followed  by  the  observation  that  solution  of  (4.  24)  is 
equivalent  to  solution  of  the  control  problem,  provides  another  way  of 
establishing  the  existence  of  biorthogonal  functions  p^  satisfying  (4.17); 
the  functions  pf  are  precisely  the  controls  steering  the  zero  initial 
state  to  the  final  state 


Thus  the  relationship  between  control  theory  of  hyperbolic  systems  and 
harmonic  analysis  is  a two-way  affair.  One  can  begin  with  results  in 
either  field  and  infer  corresponding  results  in  the  other.  In  Section  8 
this  program  will  be  carried  out  in  connection  with  a more  complex 
control  situation. 

Not  all  control  problems  for  the  scalar  wave  equation  (4.4)  can  be 

treated  in  the  context  of  (4.  1).  If  we  take  (4.4)  with  boundary  conditions 

w( 0,  t)  = w(l,t)  = 0,  for  example,  the  resulting  moment  problems  involve 

only  e^^,  k = ±1,  ±2,  . . . ; the  function  1 is  missing.  Control  is 

not  then  unique  even  for  T = 2 . If  we  take  a^  = a^  = 0,  b^  = b^  = 1 

in  (4.  5)  we  have  a multiple  zero  eigenvalue  and  the  moment  problems 

involve  the  functions  1,  t,  e^71^,  k = ±1,  ±2,  ...  . These  functions  are 

not  independent  in  L [0,2]  and  control  is  possible  only  if  T > 2 . (The 

eigenstate  w = 1,  - 0 is  suppressed  in  passing  from  (4.4)  to  (4.1) 

via  _ — w2  - . ) These  and  other  problems  of  the  same  sort 

8t  9x 

are  discussed  in  [87].  There  are  a large  number  of  open  questions 
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associated  with  more  complex  systems  wherein  the  eigenvalues  of  the 
generator  of  the  semigroup  occur  as  several  sequences  of  the  type  (4.  9) 
but  with  various  asymptotic  intervals  between  the  members  of  the  individual 
sequences  - or  the  eigenvalues  may  occur  in  clusters  with  the  clusters 
themselves  having  an  asymptotic  separation.  Some  rather  nice  theoretical 
results  have  been  obtained  along  these  lines  by  Ulrich  [107]. 

We  have  noted  earlier,  in  Theorem  2.  9,  that  if  the  finite  dimensional 
linear  control  system  (2.1)  is  controllable  then  not  only  is  it  stabilizable 
but  the  eigenvalues  of  the  closed  loop  matrix  can  be  selected  at  will  by 
appropriate  choice  of  the  feedback  matrix  K . We  are  going  to  complete 
this  section  by  describing  a comparable  result  for  the  system  (4.  1),  (4.  2) 
(or,  equivalently,  for  (4.  21) -(4.  2 3)).  To  motivate  the  procedures  which 
we  will  use,  we  briefly  review  the  proof  of  the  "spectral  assignment" 
property  claimed  in  Theorem  2.  9.  We  do  this  only  for  the  case  of  a 
scalar  control  u;  the  case  u e Em  is  fully  treated  in  [6],  [111]. 

Thus  we  consider  the  system 

x = Ax  + bu,  x € En,  u e (4. 26) 

and  the  closed  loop  systems 

x = (A  + bk  )x  (4.  27) 

obtainable  from  (4.  26)  with  use  of  linear  feedback  relations 

u = k x,  k € En  . (4.  28) 

We  assume  (4.26)  to  be  controllable.  Then 
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C = (An_1b,  An"Zb,  . . . , Ab,  b) 


is  a nonsingular  n Xn  matrix.  It  bears  a definite  relationship  to  the 
operator  C defined  just  prior  to  Definition  2.4,  representing,  as  it 
does,  a sort  of  "control  to  state"  map.  If  one  considers  distribution 
valued  controls  with  {0}  as  support, 


u = 


6 


n-1 

0 


6 


n-2 

0 


4- 


> 


(6q  is  the  Dirac  "delta  function"  with  { 0 } as  support  and  6Q  is  its 
k-th  derivative  in  the  sense  of  the  theory  of  distributions)  and  assumes 

x(O-)  = 0 


one  may  readily  verify  that 


x(0+! 


n_l  ~ 
= A bt, 


A n-2 
+ A 


bl 


2 


Since  C is  nonsingular  it  can  be  used  to  transform  the  dynamical 
system  (4.  26)  in  E°  into  another  system  in  the  space  E consisting 
of  vectors 


Indeed,  putting 
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x = C E, 


(4.  26)  becomes 


l = c"1  AC;  + C_1bu  s A l + bu 


(4.29) 


Since  the  last  n - 1 columns  of  AC  agree  with  the  first  n - 1 columns 
of  C and  since  b is  the  last  column  of  C,  A and  b must  take  the 
form 


A = 


-al 

1 

0 

0 

o 1 

° \ 

'a2 

0 

0 

0 

0 

° 

• 

b = e = 
n 

• 

-a 

n- 

1° 

0 

0 

1 

0 

-a 

n 

0 

0 

0 

° I 

1 

■ 

1’  a2’  • 

• • ) 

a , a 
n-1  n 

are 

such  that 

(4. 30) 


Anb  + a.  An_1  b + . . . + a Ab  + a b = 0 
1 n-1  n 

(and  hence  are  by  the  Cayley-Hamilton  theorem,  the  coefficients  of  the 
characteristic  polynomial  of  A (and  A)).  The  equation  (4.  30)  constitutes 
the  control  normal  form  for  the  system  (4.26). 


The  next  step  is  not  particularly  easy  to  motivate  a priori  . We  let 
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<t>  = 


0 

0 

'l 

1 

0 

‘2 

al 

1 

n-2 

an-3 

an-4 

n-1 

an-2 

an-3 

0 \ 

0 

0 


(4.  31) 


Evidently  4 is  nonsingular  and  setting 

Z = $ l 


one  may  verify  with  a little  calculation  that  (4.  29)  is  transformed  to 


; = A;  + enu  , 


(4.  32) 


e^,  as  in  (4.  30),  remaining  invariant  and 


0 1 
0 0 


A = 


0 0 


(4.  33) 


-a  -a  -a  _ 
n n-i  n-2 


-a 


The  form  (4.  32)  is  known  as  the  control  canonical  form  for  (4.  26). 

In  (4.  32)  the  effect  of  feedback 

$ * C"1  x)  (4.  34) 


u = k f,  (=  k 


is  immediately  visible.  In  the  closed  loop  system 
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the  matrix  has  the  form 


t = ( A + e k *)  t, 


A 

A + e k = 
n 


1 

0 


-a  + k,  -a  . + k^  ...  -a.  + k 

n 1 n-1  2 1 n 


«<\  yv  ^ 

The  characteristic  polynomial  of  A + e k is 

n 

P(X)  = k + (a^  - k^)\  + . . . + (a  ^ - k^)X  + (a^  - k^) 

and,  since  its  coefficients  can  clearly  be  selected  at  will  by  appropriate 
choice  of  k in  (4.  34)  (and  hence  of  k in  (4.28))  its  zeroes  can  also 
be  selected  at  will,  thereby  providing  a proof  for  Theorem  2.  9 in  the 
case  of  a scalar  control. 

To  give  some  meaning  to  $ , shown  in  (4.  31),  let  us  note  that, 

with 


E = 


0 0 
0 0 

0 1 
1 0 


0 1 
1 0 

0 0 
0 0 


we  have  £ = and  (with  denoting  complex  conjugate) 


A.  * ~ t ~ 

A = Z A 2 = £ A E . 


(4.  35) 
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Suppose  A has  eigenvectors  v^,  v^,  . ..,  and  eigenvalues 

~ * 

p p . . . p . Then  A has  as  its  eigenvectors  the  dual  vectors 
12  n 


w 


i ~ * 

,,  w w for  which  (v,,  w.)  - 6.  and  A w,  = p . w.  . Then 

1’  2’  ’ n i’  J f,n  ] j j J 


I A X IS  v.)  ^ L A v --  p Z v.  . 

J i i ) 


(4.  36) 


Combining  (4.  35)  and  (4.  36)  we  see  that 


A Z v.  = p.  Z v. 

] J J 


_ Ak 

and  we  conclude  that  Z v is  an  eigenvector  of  A corresponding  to 


the  eigenvalue  p . But,  since 


# ~ * # -1 

A = $ A ($  ) 


the  eigenvectors  of  A have  the  form  $ w^,  where  the  w^  are  the 

^ jJj 

eigenvectors  of  A . Hence 


<t>  w.  = (3 . Z v. 

J J 


(4.  37) 


for  some  p,  * 0,  i.  e.  , $ takes  the  vector  w.  of  the  dual  basis 

(w^,  W2’  • • • > w ) into  a scalar  multiple  of  the  "order  reversed"  complex 

conjugate  of  the  corresponding  element  v of  the  original  basis 

(v,,  v„ v ) . This  will  be  significant  in  a later  comparison. 

1’  2 n 

Our  next  objective  is  to  develop,  for  (4.1),  (4.2),  a control  canon- 
ical form  analogous  to  (4.  32),  (4.  33)  and  playing  a comparable  role  with 
respect  to  spectral  determination  via  feedback.  Let  w (t),  -oo  < k.  < oo  , 
denote  a solution  sequence  of  (4.13),  i.e.  , 
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dw^ 

— = 't  wk + h u(t) 

lying  in  G (cf.  (4.20)  ff)  for  all  t - which  is  true  if  it  lies  in  G for 
any  t . We  use  the  control-to- state  map  C exhibited  in  (4.20)  to  de- 
fine a new  variable  £ = t,  (t,  t)  : 

2 or  (2-t) 

wk(t)  = (C  C (t,  - ))k(t)  = / e gkUt,T)dr.  (4.38) 

The  inverse  map,  from  (4.18),  is 

30  w (t)  

ut,  T)  = l — P.  CO  . (4.39) 

k=-»  yk 

Our  task  now  will  be  to  see  how  the  dynamical  equations  (4.  13) 
for  the  w (t)  are  transformed  into  dynamical  equations  for  £,  (t,  t) 
under  the  above  transformation.  Before  doing  so,  however,  it  will  be 
useful  to  consider  a certain  relationship  which  is  satisfied  by  the  ex- 

V 

ponential  functions  e . If  we  had  o\  = a + kiri  we  would  have 

k 

cr.  2 <r , 0 

k (a+kTri)2  2a,  a k , 

e =e  =e(-ee  ) 

and  every  linear  combination 

cr,  t 

r(t)  * E rk e 

k k 

would  likewise  satisfy 

2a 

r(2)  = e r(0)  . 

Let  us  form  the  function 
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I 


P(t) 


30 

= V 

k=^3o 


o\  2 - 

k 2a 

(e  - e ) p (t)  . 


(4.41) 


1 


a-  2 (o+kiTi+  © ( — ))2 
Since  e = e 


e e 


A . 2a.,  1 1 

(k)l  = e 


k“  vs  x k //-  Za  © (^  ) - e (l+$(-))  = e + ©(r) 


2 (yk{Z") 

series  (4.41)  converges  and  pt  L f 0,  2 ] . Since  (e 


the 


2 Tk(2-T) 

J e 


2 o , 2 
(< 

0 


o.  (2--r) 


— ; — r , r , k 2a  k _ 

p(T)dr  - J (e  -e  ) e pi  ( T)d t = e - e 


’Pi)L2[0,  2]  ’ 

V 2a 


Each  linear  combination  (4.40)  with  o,  = o + kiri  + © (t-)  consequently 

K K 

satisfies 


r(2)  = e2°  r(0)  + / p(2-r)  r(r)dr 
‘ 0 


To  find  the  dynamical  equation  satisfied  by  £ (t,  t)  we  differentiate 
(4.  39)  formally  and  then  use  (4.  13)  and  (4.  38); 


dw. 


St,  (t,  t)  Y1  dt 

9t  Ti  gk  Pk(T)  = 

k=-» 


so  or  w(t)+g  u(t) 

V k k k — — 

L g Pk(T>  = 

k — - 30  9k  K 


30  2 o^(2-t)  __  00 

it  f e ' C (t,  T)dr  p (t)  + u(t)  V p (t) 

k=-oo  * 0 k k=-30  k 


= (integrating  by  parts) 


30  2 cr  (2-t)  ~ 

V (fek  d^(t}  T) 

k=  -30  0 


30  Zar 

V --  k ~ 


4 2j  (e  Ut,0)  - Ut,2)  + u ( t ) ) pk(r) 
k=-ao 


(4.42) 
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2a 


Adding  and  subtracting  e t,  (t,  0),  the  second  sum  above  becomes 
00  2<r. 


Vk  2 o ~ ' — 

(e  - e ) t,  (t,  0)  pk(T) 

k=-oo 


(4.43) 


+ Tj  (e2°  Ut,  0)  - Ut,  2)  + u(t))  p (r)  . 

k=  -<* 


oc 


The  second  sum  here  is  just  a multiple  of  ^ p ( -r)  which  can  be 

k=-°o  k 


shown  to  be  the  distribution  6(2 -t)  . We  eliminate  it  by  requiring 


1 (t,  2)  = e2°  ;(t,  0)  + u(t) 


(4.44) 


From  (4.  41)  the  first  term  in  (4.  43)  is  seen  to  be  just  p(r)  t,  (t,  0)  . 
now  (4.42)  gives 


So 


H(t,  t) 

at 


2 a,  (2  -t)  . ~ 

</  * 


l 

k=  -oc 


0 


pk(T» 


+ P(T)  t,(t,  0)  . 


But  the  sum  above  is  just  (see  (4.  38)  and  (4.  39))  C-*  C(t^  ) = Tp-  . So 

9t  or 


we  have 


or  or  ~ 

af  = g7  + P(T)  Ut,  0)  . (4.45) 

The  equations  (4.4  5)  and  (4.44)  constitute  the  control  normal  form  for 
(4.  13)  (hence  (4.  1),  (4.  2)  and  should  be  compared  with  (4.  29),  (4.  30). 

The  above  derivation  is  entirely  formal.  In  [95]  it  is  justified  by 
making  use  of  the  observation  normal  form  for  the  dual  observed  system. 
To  pass  to  the  control  canonical  form  we  employ  a transformation 
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C,  = <t>  C,  of  "convolution  type”  which  may  be  regarded  as  a distributed 
analog  of  (4.  31): 


Ut,  T)  = (<U(t,  *))(T)  C(t,T )-[  P(  r-cr)  4(t,  tr  )dcr  . 

0 


(4.46) 


Substituting  this  expression  in  (4.44)  we  have 


2 


Ut,  2)  - f p(Z-cr)  t,(t,  o-)  d<r  = e^'  U4  0)  + u(t) 

' 0 

whence,  renaming  the  variable  of  integration, 

2 2 

Ut,  2)  = e ° Ut,  0)  + / P(2-t)  ;(t,  T)dT  + u(t)  . 

0 


(4.47) 


To  see  how  (4.4  5)  transforms  we  observe  that 


r &Ut,  r)  _ H(t,  t) 


9t 


- PC’)  4 (t,  0) 


= -jfT  (Ut,  T)  - / p(T-o-)  Ut,cr)do-) 


- (4(t,  t)  . f p(t-<t)  ut,<r)do -)  - p(r)  ;(t,  0) 

‘ 0 

- ^ - f ^ ««.*> 


I 


0 


^ P( T -cr) 
0t 


- Ut,ar)dor  - P(T)  4(t,  0)  • 


Noting  that 


9p(T-o-)  9 p(  T -cr) 


9t  9(r 

integral  above  we  obtain 


and  integrating  by  parts  in  the  second 


from  which  we  conclude  that 
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aut.  -) 


at 


^(t,  t) 

Ot 


0 


4.48) 


Now  (4.48)  shows  that  t;  is  constant  on  lines  t + r = c (constant). 
We  may  therefore  write  4 as  t,(t  + t)  and  (4.47)  becomes 


C(t+2)  = e“°  ?,(t)  + f P(2-t)  t,(t  + T)dT  + u(t)  . 

0 


(4.49) 


This  is  a neutral  delay  equation  which  constitutes  the  control  canonical 


form  for  the  system  (4.  13)  (hence  (4.  1)  . (4.2))  . The  distributed  part  of 


cr,  2 

. k 2 a 

the  delay  vanishes  when  B(x)  = 0 in  (4.  1)  since  e = e in  that 


case  for  all  k and  p (cf.  (4.41))  is  thus  identically  equal  to  zero. 

This  canonical  form  should  be  compared  with  (4.  32),  (4.  33)  of  which  it 
is  a distributed  analog. 

Just  as  the  control  canonical  form  for  finite  dimensional  systems 
enables  one  to  see  that  eigenvalues  of  the  closed  'oop  system  can  be 

jjj 

placed  at  will  by  appropriate  choice  of  the  feedback  vector  k , the 
canonical  form  just  obtained  can  be  used  in  a similar  way  with  respect  to 
the  system  (4.13)  (or,  equivalently,  (4  1),  (4.2)). 

Let  us  suppose  that  we  determine  the  control  u(t)  by  means  of  the 
linear  feedback  law 

1. 


u(t)  = 


. k 


(4.  50) 


w (•  , t) 


L2([0, 1 j;E2) 


where 


1 


k = \ ' « L ([0,  1 ]; E ),  k = V k 4> 

k I J=-»  1 1 
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L..  , . 


The  closed  loop  system  thus  obtained  involves  addition  of  a "dyadic 

1 


,w 


operator"  to  the  generator  L(  (cf.  (4.  8)): 


w 


„ 1 1 , - 1 1 1 

d w w 019w  w w 

5T<  2|lLk<  2'  = <1  o’ta'  2>+B,X><  2>+0(x)«  2),  U.  (4.51) 

W W WWW 


Our  next  objective  is  to  indicate  how  this  closed  loop  system  transforms 
under  the  action  of  the  mappings  C and  4>  above  (cf.  (4.  38),  (4.46)). 

In  terms  of  the  variables  present  in  the  system  (4.  13)  it  is  clear 
that  the  feedback  law  (4.  50)  must  take  the  form 


u(t)  = V k.  w.(t) 

. -1  ) j 

]-_OC 

and,  by  virtue  of  (4.  38),  in  terms  of  the  variable  t,  of  the  control  normal 
form  we  have 

oc  2 cr.(2-  t) 

u(t)  = V k g.  f e 1 Z (t,  T)dr  . 

j = -oo  ] ] 0 

From  (4.46)  we  then  have 

oo  2 o-.(  2-t) 

u(t)  = V k.  g,  f e 3 & C(t,  T)dr 

j^oo  3 J 0 

oo  2 (r . ( 2 - • ) 

= 7,  k g f (4>'e  3 )(t)  ;(t,  T)dT  . 

j — — oo  ] } o 

The  following  proposition  is  perhaps  the  most  important  and  most  diffi- 
cult of  the  whole  theory.  We  refer  the  reader  to  [9  5]  or  [96]  for  (two 
different)  proofs  and  more  details. 
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<r(2--)  

Proposition  -4.  1.  ($  e )(t)  = p^  p.(2-t)  where  p.  is  the 

biothogonal  function  defined  by  (4 . 17 ) and  0 < b < p < B,  - * < j < r . 

The  result  stated  in  this  proposition  is  precisely  the  analog  of 

the  formula  (4.  37)  developed  earlier  for  the  finite  dimensional  case.  In 

addition  this  result  shows  that  the  map  <f>  is,  except  for  a trivial  change 

of  independent  variable,  the  map  used  by  N.  Bari  in  her  treatment  of 

Riesz  bases  (see  discussion  and  reference  in  [66]). 

We  finally  have,  then, 

20  2 

u(t)  = ^ k g.  f p p (2-t)  4(t,  T)dT 

j^-oo  J ^ o 3 j 

2 

= f k (2-t)  t,(t,  T)d t (4.  52) 

0 g 

where 

00 

k (t)  = Y k g p P (2-  r)  . (4.  53) 

y . j j j j 

J ” —20 

Consequently,  in  terms  of  the  canonical  variable  the  closed  loop 
system  (4.  51)  becomes  (cf.  (4.  49)) 

Ut+2)  = e2u  ;(t)  + f (P(2-t)  + k (2-t))  ;(t-t-T)dr  . (4.54) 

0 g 

We  see,  therefore,  that  the  control  canonical  form  has  the  property  of 
being  invariant  under  feedback.  A feedback  relation  (4.  50)  just  modifies 
the  kernel  p(2-t)  in  the  distributed  delay  term.  It  is  this  invariance 
property  which  enables  us  to  study  how  the  spectrum  of  the  closed  loop 
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system  (4.  51)  can  be  modified  when  u obeys  a feedback  law  (4.  50). 

The  main  result  is 

Theorem  4.  2.  Let  distinct  complex  numbers  p.,  -»  < j < oe)  be  selected 

with  the  property 


00  o- . - P.  2 
V J l 


!4.  55) 


Then  there  is  a feedback  vector  function  k « [0,  1]  for  which  the 


closed  loop  system  (4.  51)  has  precisely  the  eigenvalues  p 


Sketch  of  Proof.  Consider  a system  (cf.  (4.  13)) 


_ = p.  W.,  -oo  < j < oo 


having  the  desired  p^.  as  eigenvalues.  Reducing  this  system  to  canonical 
form  one  obtains  (cf.  (4.49)) 


t,(t  + 2)  = e C'  ;(t)  + f q(2 - t)  £(t  + T)dT 

0 


where  (cf.  (4.41),  (4.17)) 


00  PiZ 

V7  J C*Q 

q(T)  = (e  - e ) q.(T)  , 

j = -00 


2 Pj(2-r) k 

f e q (t)  dr  = 6 . 

0 k J 


To  realize  the  p^.  as  eigenvalues  of  the  closed  loop  system  (4.  51)  it  is 
necessary  and  sufficient  that 


p(2-  r)  + kgU-T)  = q(2-T) 


Referring  to  (4.  52)  we  see  that  the  question  is  whether  or  not  there  is  a 

°°  2 

sequence  { k . } with  Y |k  1 < oo  such  that 


Z kj  9j  Pj(2"T)  = q(2"T)  ' p<2-t) 


With  a rather  simple  calculation,  which  just  involves  the  estimation  of 
the  expansion  coefficients  of  q(2-T)  with  respect  to  the  functions 
Pj (2 - t)  it  is  shown  in  [95]  that  this  can  be  done  if  the  inequality  (4.  55) 
is  satisfied.  The  calculation  does  not  show  that  (4.  55)  is  necessary  but 
it  does  indicate  that  no  significant  improvement  is  likely. 

We  have  noted  that  in  the  case  of  boundary  control  ((4.  21),  (4.  22), 
(4.23))  the  g^  (cf.  (4.24))  are  bounded  and  bounded  away  from  zero. 

It  can  be  shown  in  this  case  that,  by  utilization  of  feedback  relationships 
of  the  form  (4.  50),  one  can  realize  values  p.,  -<*  < j < °o,  for  the  closed 

loop  system  provided 


Z k - Pjl  < 00  • 

\-  -00 


(4.  56) 


In  both  cases,  (4.  5 5)  and  (4.  56),  any  finite  subcollection  of  the  points 
of  the  spectrum  can  be  moved  at  will.  Indeed,  the  whole  spectrum  can 


be  moved  "at  will"  in  a sense.  If  one  selects  a sequence  of  disc 

oo  r.  2 
v1  1 <oe 

neighborhoods  N.  of  radius  r.  centered  at  tr,  with  ) — 

„ 2 « 1 1 sj 

(V  | r | < oc  in  the  boundary  control  case)  then  the  p,  can  be  placed 

• ^ j 3 

j - .00  J 

at  will  in  the  neighborhoods  N.  . 
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If,  in  addition  to  having  boundary  control,  as  in  (4.21),  (4.22), 

(4.2  3),  one  likewise  permits  boundary  feedback,  e.  g. 

1 2 

u(t)  = a w (1,  t)  + (3  w (1,  t)  + ((  ),  k) 

W L([0,1];E2) 

We  then  obtain,  as  the  closed  loop  system,  (4.21), 

1 2 

(a  -a  ) w (1,  t)  + (b  -(3  ) w (1,  t)  = ((  ),  k) 

w L ([0, 1];E^)  (4.57) 

In  this  case  one  can  change  the  "base  point"  a (cf.  (4.  9))  of  the 
spectrum  to  any  other  desired  base  point  p and  (4.  56)  is  replaced  by 
00 

T.  l(o-  - p ) - (a  - (3)  |2  < oo  . 
j = .oo  J 1 

Thus  the  asymptotic  vertical  line  along  which  the  spectrum  is  located 
can  be  moved  and  then  square  summable  changes  in  the  c r.  can  be 
carried  out  as  well. 

One  of  the  outstanding  problems  remaining  in  this  area  is  to  extend 
the  notion  of  control  canonical  form  from  the  very  special  class  of  dis- 
tributed systems  considered  here  to,  first  of  all,  higher  dimensional  hyper  - 
bolic  systems  and  parabolic  systems  and,  secondly,  but  even  more 
fundamentally,  to  a general  abstract  framework  which  will  include  a large 
class  of  linear  distributed  parameter  systems.  While  very  little  can  be 
said  now  about  such  a project  it  is  safe  to  predict  that  the  "control  to 
state  map"  C and  the  map  $ , which  we  have  seen  relates  a Riesz 
basis  to  its  dual  basis,  will  remain  important  in  the  theory. 
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5. 


BOUNDARY  CONTROLLABILITY  THEORY  FOR  HIGHER  DIMENSIONAL 
HYPERBOLIC  SYSTEMS 


In  contrast  with  the  relatively  complete  theory  which  we  have  seen 
to  exist  for  hyperbolic  equations  involving  only  a single  space  variable 
x,  confined  to  an  interval  0 < x < 1,  the  control  and  observation  theory 
for  processes  taking  place  in  multi-dimensional  regions  is  quite  primitive. 
This  is  due  to  the  fact  that  the  characteristic  surfaces  arising  in  such 
problems  are  nowhere  near  as  constructively  useful  as  in  the  one  dimen- 
sional case.  Nevertheless  such  theory  as  does  exist  does  not  lack 
interest,  as  we  shall  point  out. 

Virtually  the  only  multidimensional  hyperbolic  control  process 
which  has  received  any  attention  is  the  scalar  wave  equation,  which  we 
shall  write  here  in  the  form 

3^w  9 i 9w 

p(x)  — y - \ — T-(o.(x)  — Y ) = 0 , (5.1) 

9t  i,  j = l 9x  J dx 

2 

where  p(x)  a (x)  are  real  valued  functions  of  class  n in  the  closure 
ij 

of  fi,  a bounded  open  connected  set  in  Rn  with  piecewise  smooth 
(plus  other  conditions  as  required  for  existence,  uniqueness,  etc.  ) 
boundary  r . It  is  assumed  that 


P(x)  > PQ  > 0,  X € fi  , 

(5.2) 

(w,  A(x)w)  > a ||w||  n>  «o  > 0 ’ 

(5.  3) 

E E 

where  A(x)  is  the 

n X n symmetric  matrix  with  entries  a*(x)  . 

The 

condition  (5.  3)  is, 

of  course,  the  familiar  "uniform  ellipticity" 

condition 
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We  denote  by  v(x)  the  (almost  everywhere  defined  and  unique) 


unit  exterior  normal  to  r = 9f2  at  x e r . We  assume  r divided  into 
two  parts 

r = ro  Uri’  r0  n ri  = * ’ (5-  '}) 

with  Fj  non-empty  and  relatively  open  in  r . Also,  unless  explicitly 
stated  to  the  contrary,  we  will  assume  to  have  a non-empty  in- 

terior as  well. 

The  idea  now  is  that  control  will  be  exercised  on  while  only 
natural,  passive  constraints,  in  the  form  of  boundary  conditions,  will 
act  on  rQ  . This  still  leaves  a number  of  possibilities  for  different 
types  of  control  application  and  different  types  of  constraints.  To 
avoid  inessential  complication  in  this  expository  treatment  we  shall  con- 
sider only  solutions  w = w(x,  t)  of  (5.  1)  for  which 

w(x,  t)  a 0,  x e rQ  (5.  5) 

(v(x),  A(x)^(xft))  n = u(x,  t),  x € Tj  (5.6) 

2 

u being  a scalar  control  function  restricted  to  lie  in  L (r  X[0,  T])  for 
any  fixed  T > 0 . Thus,  again,  we  are  concerned  with  a "boundary 
control" . 

It  is  by  now  an  almost  classical  result  that  if  one  gives  initial  data 

w(x,  0)  = wQ(x),  x c n,  wQ  € Hm(f2)  (5.7) 

9w  m-1 

— (x,  0)  = vQ(x),  x £ O,  vQ  £ H (fi)  , (5.  8) 
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(where  H m(S2)  is  the  usual  Sobolev  space  [ 1 ])  and  if  appropriate  consistency 

conditions  relating  w^,  to  the  boundary  conditions  (5.  5),  (5.  6)  are 

met,  then  (5.  1),  (5.5),  (5.6),  (5.7),  (5.  8)  has  a unique  generalized 

m m 1 

solution  w € c ([0,  oo);  H (ft)  © H (ft))  in  the  case  where  u(x,  t)  h 0 . 
When  a simple  matter  to  relate  regularity  of  u to  regularity  of  the  solu- 
tion w . An  extensive  treatment  is  presented  in  [58],  [59].  A rather 
simple  and  explicit  (negative)  example  is  provided  by  Graham  in  [3  3]  for 

the  case  where  ft  is  the  ball  in  Rn,  p(x)  h l?  A(x)  = 1,  = r = 3ft  , 

2 

rQ  = . There  it  is  shown  that  controls  u t L (r  © [0,  T])  are  capable 

9w 

of  producing  states  (w(-,T),  v(-,T)  ( = -^-(-,T)))  which  are  not  "finite 
energy"  states,  :.  e.  do  not  lie  in  H^(ft)  © H°(ft)  . This  "excessive" 
effect  of  controls  u might  seem  to  promise  rather  strong  controllability 
results  but,  in  fact,  it  leads  to  considerable  difficulty  in  some  theorems, 
as  we  shall  point  out  in  Section  6. 

The  basic  "approximate  controllability"  result  appears  in  [88],  [89]  . 
The  approach  was  suggested  to  the  author  by  J.  L.  Lions  in  a private 
correspondence  and  takes  the  same  form  as  a comparable  result  for  the 
heat  equation  proved  in  [57].  (More  on  this  in  the  next  section  which 
treats  the  heat  equation).  As  is  usually  the  case  with  theorems  of  this 
type,  it  is  proved  in  the  context  of  the  duality  structure  discussed  in 
Section  2. 

Referring  again  to  Figure  2.1  and  the  preceding  material  in  Section  2, 

we  let 
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A 


X = Z = {fw,  v)  e h\  ii)  © H°(J2)  |w(x)  = 0,  x « rQ  } (5.9) 

with  the  Hilbert  space  structure  induced  by  h\s2)  © H°(S2)  . The  map 
S:  X — Z is  defined  for  a fixed  T > 0 by 

S(wQ,  vQ)  = (w  (•  , T),  v (• , T))  , (5.10) 

w (• , T),  v (•  , T)  (=  (■  , T))  being  the  state  achieved  at  time  t = T by 

the  generalize  solution  w,  and  its  time  derivative  determined  by  (5.  1), 
(5.  5),  (5.  6)  (with  u = 0),  (5.  7)  and  (5.  8)  (with  m = 1 ).  We  let 

Y = lVj  X[0,  T]) 

and  define 

Cu  = (w  (•  , T),  v (• , T))  (5.11) 

where  w is  the  solution  of  (5.  1),  (5.  5),  (5.  6)  with 

A 

w(-,0)  = v(-,0)  (h-^(-,0))  = 0 . (5.12) 

A A 

We  have  remarked  that  we  cannot  be  certain  that  (w(-  , T),  v(-  , T))  c Z 
2 

for  arbitrary  u e L (r^  X [0,  T ])  . Hence  C is,  in  general,  an  unbounded 

operator,  as  anticipated  in  Section  2.  We  take  the  domain  of  C,  fi(C)  , 

2 

to  be  the  subspace  of  Y = L (r}  X [0,  T])  consisting  of  u for  which 
w (•  , T),  v(-,T)  does,  indeed,  lie  in  Z,  as  defined  by  ( 5.  9)  (note 
that  S and  C are  constructed  just  as  in  Section  3,  but  in  that  section 
C was  a bounded  operator).  We  remark  that  the  fact  that  &(C)  is  dense 
in  Y follows  from  the  fact  (see  [88],  [59])  that  S(C)  contains  all  func- 
tions u € 2 ^ ( Tj  X [0,  T])  whose  support  is  a closed  subset  of  the  interior 

of  ri  X [0,  T]  . 
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Now  let  y = y(x,  t)  be  a solution  in  2 of 


2 n . . . 

9 y v1  o 1 , ay  , 

p<*>  -r  - I — «yx>  ~t  > = 0 

9t  i,  j = 1 9x  9x 


satisfying  boundary  conditions 


y(x, t)  = 0,  x e r 


(5.14) 


(v(x),  A(x)  -f  (x,  t))  n = 0,  x e r 


(5. 15) 


We  introduce  the  energy  form  on  X = Z : 

rZ  dw  3 w 

[ p(x)  v(x,  t)  + {—  (X,  t),  A(x)  t)  n]dx 

n e 


(5. 16) 


and  the  related  "energy  inner  product" 


<(w(- , t),  v(- , t));  (y( * , t),  z(-,t)))E 

;9w  9v  i , 

[ P ( x)  v(x,t)  z(x,t)  + (— (x,t),  A(x)— (x,t))  n]dx 

Q E 

, 9w  9y 

for  which  \l  2E  is  the  associated  norm.  Letting  w,  v = — , y,  z = — 

satisfy  (5. 1),  (5.5),  (5.  6)  and  (5.  13),  (5.14),  (5.15),  respectively,  a 

formal  computation  (which  can  be  justified,  using,  e.g.  , the  methods  in 

[59])  shows  that 

<(W(-,T),  v(.,T));  (y(*,T),  z(.,T))>  - <(w(-,0),  v(-,0));  (y(- , 0),  z(- , 0))>£ 


= J J z(x,  t)  u(x,  t)ds  dt 

0 r. 


(5.17) 
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where  ds  is  the  element  of  surface  area  on  r . 

If  we  now  replace  (w,  v)  by  (w,  v),  the  solution  of  (5.  1),  (5.  5), 
(5.  6)  with  w(- , 0)  = v(- , 0)  = 0,  we  find  that  (5.  17)  becomes 


<Cu,  (y( 


' , T)>  z(*,T))>  = (u,  z) 

L (riX[0,T])(=Y)  , 


which  gives  (C  now  being  defined  in  terms  of  ( , ) p) 


C (y( • , T),  z(- , T))  - Z 


(5.18) 


1 rx  x[o,  T] 


i.  e.  , C associates  with  the  terminal  state  ( y ( • , T),  z(-,T))  the  ob- 

dy 

servation  which  is  the  restriction  to  r X [0,  T]  of  z(x,  t)  = -^jr(x,  *)>  V 
being  the  solution  of  (5.  13),  (5.  14),  (5.  15)  with  the  indicated  terminal 
state  (y(-  , T),  z(*  f T) ) . The  domain  of  C consists  of  those  (y(* , T)  , 
z(-T))  for  which  z « l/(I'  x[0,T])  . (Note  that  the  "trace  theorem" 


2(rj)). ) 


([1],  b8])  only  gives  ze  ” ([0,  T];  H 

If  we  let  (w,  v)  be  (w,  v),  the  solution  of  ( 5.  1),  (5.5),  (5.6), 
(5.7),  (5.8)  with  m = 1,  u = 0,  we  find  that 

( S(WQ,  VQ);  (y(*  , T),  z(-,T))>E  = <(w0,VQ);  (y(.,0),  z(- , 0))>  £ 

so  that 


S (y(-,T),  z(-,T))  = (y(-,0),  z(-,0))  . 


(5.19) 


We  should  remark  here  that  if  rn  has  a non-empty  interior  the 
energy  norm  \fz E (cf.  (5.  16)  ff.  ) is  equivalent  to  the  induced  norm  in 
Z C H^(S7)  © H°(it)  and  Z,  with  this  norm,  is  a Hilbert  space  V(.  . 
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When  I'  is  empty,  the  important  case  wherein  (5.6)  applies  on  all  of 
r = dn,  then  the  energy  (5.16)  vanishes  for  w = constant,  v = 0 and 
\ 2E  is  only  a semi-norm.  If,  however,  we  identify  states  which  differ 
by  a state  (c,  0),  c constant,  then  V 2 E is  a norm  and  the  resulting 
Hilbert  space,  which  we  shall  still  call  V,  , is  equivalent  to  the  orthog- 
onal complement  of  (c,  0)  in  Z C H (U)  0 H^(f2)  . These  conventions 

will  be  tacitly  assumed  in  the  sequel.  In  any  event  the  adjoints  S , 

* 

C are  well  defined,  modulo  these  conventions,  and  if  they  are  con- 
fused with  the  adjoints  defined  relative  to  the  usual  inner  product  in 
1 0 

H ( !2)  ® H (Q)  little  damage  will  result. 

Approximate  controllability  of  (5.  1),  (5.5),  (5.6),  i.e.  of 
{X,  Y,  Z,  S,  Cj,  is  assured  if  we  establish  observability  of  {C  ,S  , Z,  Y,  X} 
In  the  current  context  this  means  we  should  show; 


Z|  =0  in  L2(T  X[0,  T]) 

r1  X[0,  Tj 

=>  S (y( • , T),  z( • , T))  = (y(*  , 0),  z(-,0))  = 0 in  . 


(5.20) 


If  this  is  true  then  (5.  1),  (5.  5),  (5.  6)  is  approximately  controllable  in 

the  sense  that  each  initial  state  (wQ , v ) can  be  steered  arbitrarily 

close,  in  the  energy  norm,  to  the  zero  state  at  time  T,  using  a control 
2 

in  L (Tj  X [0,  T])  . However,  since  this  system  is  time  reversible,  one 
readily  sees  that  this  is  equivalent  to  the  question  as  to  whether  one  can 
steer  from  any  state  (wf),  v^)  ( Z to  a state  arbitrarily  close  to  any  other 
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(w.,  v ) € Z with  respect  to  the  topology  of  V (=Z  with  the  E norm, 

I 1 L 

as  noted  earlier). 

Let  us  stop  and  consider  what  (5.  20)  means.  We  are  given  a solu- 
tion y = y(x,  t)  of  (5.  13),  (5.  14),  (5.  15).  Condition  (5.  15)  is 

(v(x),  A(x)^  (x,  t))  n = 0,  x e Tj,  0<t<T 
while  the  premise  of  (5.  20)  is 

z(x,t)  = |f(x,t)  = 0,  x « Tj,  0<t<T.  (5.21) 

If  we  differentiate  (5.  15),  formally,  with  respect  to  t,  we  have,  in  its 
place, 

(v(x),  A(x)||  (x,t))  n = 0,  x « I\,  0 < t < T . (5.22) 

dv 

Now  (5.  21),  (5.  22)  constitute  zero  Cauchy  data  for  the  solution  z = — 
of  (5.  13)  on  r,  X [0,  ] . The  question  is  then  seen  to  be  one  of  unique- 
ness; if  these  Cauchy  data  are  zero  is  it  true  that  the  solution  z is 
zero? 

The  fact  that  the  data  (5.  21),  (5.  22)  are  given  on  the  "time-like" 

(cf.  [11])  surface  x [0,  T]  does  not  permit  us  to  use  the  usual  existence 
and  regularity  results  for  hyperbolic  initial -boundary  value 

problems.  However,  our  problem  is  not  one  that  has  been  historically 
neglected  as  it  falls  into  a class  of  uniqueness  problems  studied  by 
Holmgren  [40],  John  [4  3]  and  others.  A strengthened  and  modernize 
version  of  these  uniqueness  theories  appears  in  Hormander's  b 
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To  describe  these  results  adequately  it  will  be  necessary  to  introduce 
the  concept  of  a characteristic  surface  for  (5.13).  Let  4>(x,  t)  be  a 
continuously  differentiable  function  of  (x,  t)  and  let  2 be  the  surface 


Z = <t>(x,  t)  = 0 . 

For  points  (x,  t)  e 2 consider  the  "characteristic  form" 

34>  2 34>  94> 

X = P(x)  (gf  ) - » A(x)  }En  » 


denoting  the  gradient  of  <b  with  respect  to  x . We  say  that  2 is 
characteristic  if  \ = 0 on  2 ; 

space-like  if  x > 0 on  2 ; 

time-like  if  x < 0 on  2 . 


Consider  now  the  set  K(T^,T)  defined  as  follows;  (x,  t)  t K(r^,  T)  if 
(x,  t)  lies  in  the  interior  of  a "lens-shaped"  region  whose  boundary  con- 
sists entirely  of  a portion  of  X [0,  T]  and  a portion  of  a second  (arbitrary) 
time  like  surface  2 . It  can  be  seen  that 

(a)  9K(  r^,  T)  - (I~j  X[0,T])  consists  of  characteristic  surfaces; 

(b)  K(T  , T)  is  symmetric  about  the  plane  t = T/2  ; 

(c)  K(Tj,  T)  H (n  © {T/2})  includes  all  points  (x,  T/2),  x«  n, 

for  which  the  distance  from  x to  T.,  relative  to  paths  in 

Mx)  i T 

£2,  is  less  than  min  ( ■ - ) ? — , where  X (x)  is  the 

xen  p(x)  2 1 

smallest  eigenvalue  of  A(x),  x « ft; 

(d)  K(Fj,  T)  does  not  include  any  points  (x,  T/2)  of  SIX  {T/2} 


for  which  the 

Vx)  - 

max  ( 777C  ) 2 


xt  a 

of  A(x) 


p(x) 


"i2-distance  from  x to  exceeds 

T/2,  where  \^(x)  is  the  largest  eigenvalue 


(e) 


KfTj.T) 


varies  continuously  with  respect  to  T (with  respect 


to  the  Haussdorff  metric,  e.  g.  ) 


Some  descriptive  sketches  are  given  in  [88]. 

From  the  above  we  can  infer  that  there  is  a least  time  Tj  for  which 

K(rr  T)  D (U  © {T/2 })  = V © {T/2},  T > T - 

Now  the  above  cited  Holmgren- John  uniqueness  theory  includes  the 
following  result 

Theorem  5.  1.  If  x is  a generalized  solution  of  (5.13)  satisfying  (5.21) 
and  (5.  22)  then 

Z = 0,  (x,  t)  € K(T p T)  . 

Consider  now  the  observation  (equivalently,  the  control)  problem 
for  a time  T > T , Tj  as  defined  above.  Choosing  e so  that  + 2e  = T 
we  can  see  that 


KfTj,  T)  D © (T/2  - e,  T/2  + e) 

and  from  this  it  is  possible  to  conclude  that 

z(x,  T/2)  = (x,  T/2)  = 0,  x € a . 

The  usual  uniqueness  results  then  show  that 

z(x,  t)  s 0,  x € n,  t « [0,  T]  . 
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In  any  event  we  have  seen  that  for  T > the  condition 


C*(y(-,T),  z(.,T))  = Z|  (=|f|  ) 

r^fo,  T]  Tj®[0,  T] 


= 0 


(5.24) 


implies 


0y 

y(x,  o)  = c,  ~ (x,  o)  = o,  x e n , 


where  c is  some  constant,  equal  to  zero  if  has  a non-empty  interior. 
Thus,  in  every  case  we  have 


. 
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ll(y(*,o),  z(.,o))||  = o , 

* E 


# % 

i.  e.  , the  implication  (5.  20)  is  valid.  Thus  {C  , S , Z,  Y,  X}  is  distinguish- 
able and  (X,  Y,  Z,  S,  C)  is  approximately  controllable.  When  has 
a non-empty  interior  the  latter  means  that  (5.1),  (5.  5),  (5.6)  is  approxi- 
mately controllable  in  the  space  X = Z with  the  H ( ft)  © H°(ft)  topology. 

When  rQ  is  empty  this  approximate  controllability  means  control  modulo 
3 w 

constant  states,  = 0,  w s c . Almost  certainly  a more  refined  analysis 
would  show,  in  fact,  that  if  T > Tj  the  constant  component  of  the  solution 
can  be  controlled  as  well  but  this  has  not  been  done,  to  the  authors 
knowledge,  except  for  the  case  where  p(x)  = 1,  A(x)  = I and  ft  is  a 
ball  in  En  ([33]). 

To  save  space  we  mention  only  briefly  the  cases  T = T^  and  T < T^  . 
For  T < Tj  we  have  neither  distinguishability  of  (5. 13),  (5.  14),  (5.  15) 
via  the  observation  (5. 18)  nor  approximate  controllability  of  (5.  1),  (5.  5), 
(5.6).  The  proof  appears  in  [88],  [89].  For  T = T^  the  situation  is 
rather  complex  as  the  mutual  geometry  of  and  ft  plays  a decisive 
role.  For  example,  if  ft  is  the  unit  ball  in  Rn,  p(x)  = 1,  A(x)  = I and 
Tj  = 3ft  (rQ  empty)  we  do  have  approximate  controllability  for  T = T^  . 

But  if  ft  is  the  rectangle  |xX|<_l,  i = 1,  2,  . . . , n and  consists  of 
two  faces,  e.  g.  the  faces  x^  = ± 1,  we  do  not  have  approximate  con- 
trollability for  T = Tj  . The  mathematical  discussion  appears  in  [89]. 

To  summarize,  we  have 
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Theorem  5.  2.  There  is  a T^  > 0 such  that  the  system  (5.  1),  (5.5), 
(5.6)  is  approximately  controllable  in  y^  (Z  with  the  E topology) 
using  controls  u « L2(Tj  * [®>  T]),  provided  T > T^  . Jf  T < T^  the 
system  is  not  approximately  controllable.  If_  T = Tj  approximate  con- 
trollability  may  or  may  not  obtain,  depending  on  the  mutual  geometry  of 


Q and  Tj  (details  in  [89]). 

The  parallel  distinguishability  result  for  (5.13),  (5.14),  (5.15)  with 
the  observation  (5.  18)  is  the  validity  of  (5.  20)  for  T > T^,  etc. 

The  above  approximate  controllability  result  has  been  applied  to 
resolve  certain  stabilizability  problems.  We  will  discuss  this  further 
in  Section  6 . Nevertheless  it  is  a rather  weak  result  with  meagre  con- 
sequences - as  one  should  expect  from  the  ease  with  which  it  is  proved. 
Exact  controllability  results  come  much  harder  but  are  worth  a lot  more, 
mathematically,  once  one  has  obtained  them.  They  are  our  next  order 
of  business. 

We  consider  again  the  system  (5.  1),  (5.  5),  (5.  6)  but  we  restrict 
attention  to  the  case  p(x),  A(x)  constant.  It  is  then  easy  to  see,  via 
a change  of  variables  ([93]),  that  we  may  as  well  treat  only 

a2  n a2 

3w  _ - i»  =0  ,5.25) 

at  i=i  (ax  ) 

the  traditional  " wave  equation".  The  boundary  conditions  become 

w(x,  t)  = 0,  x € r t > 0 (5.  26) 


j^(X,t)=U(X,t),  X € rv  t>0 
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where  - — denotes  the  directional  derivative  along  the  normal  direction 
9v 

v to  the  boundary  r of  ft  . 

We  shall  require  the  concept  of  a "star-shaped"  region  in  what 

# n * # 

follows.  A region  ft  C R is  star  shaped  if  there  is  a point  x « ft 

3fc 

such  that  for  every  x € ft  the  line  segment  joining  x and  x also  lies 
in  ft  . A further  notion  which  we  shall  use  is  that  of  a "star-comple- 
mented" pair.  We  consider  the  region  ft  C Rn  with  boundary  9ft  = r 
and  we  let  denote  a relatively  open  subset  of  r . We  say  that  the 
pair  (S2,  Tj)  is  star-complemented  if  there  exists  a star  shaped 

# sje  # 

region  ft  with  boundary  9 ft  = r such  that 

c 

* — c 

Si  C ( closure,  complement) 

r = r - r ' , i.  e.  r - c r ' . 

(See  the  geometric  configuration  displayed  in  Fig.  5.  1.  ) An  equivalent 
statement  would  be  to  say  that  ( S2,  r^)  is  star  complemented  if  there 
is  a point  x <J  n with  the  property  that  each  point  x « rQ  = r - 
can  be  connected  to  x by  a line  segment  which,  except  for  the  point 
x itself,  lies  entirely  outside  Q . 

It  is  also  necessary  to  assume  certain  "regularity"  conditions 
concerning  the  manner  in  which  meets  £2  . These  conditions  permit 

one  to  employ  the  "reflection"  methods  described  in  [58]  in  order  to  carry 
out  the  extension  procedure  to  be  described  in  the  proof  of  the  next 
theorem.  It  would  carry  us  too  far  astray  to  detail  these  assumptions 
here.  The  interested  reader  is  referred  to  [93]. 
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I 

J 


Figure  5.  1.  The  domains  £2,  £2  , £2 

e 

The  main  (exact)  controllability  result  is 
Theorem  5.3.  Consider  the  system  (5.25),  (5.26),  (5.  27)  in  an  open, 
bounded  connected  region  £2  C Rn  with  boundary  r having  as  a 
relatively  open  subset.  If  the  pair  £2,  is  star-complemented  then 
this  system  is  exactly  controllable  in  the  space  (with  induced  Hilbert 
space  structure) 

Vq(O)  = f(w,  v(=^))  € H2(£2)  ©H!(£2)|  w(x)  = 

v(x)  = 0,  x « r - r2  = rQ  } (5.  28) 

in  the  sense  that  there  is  a positive  number  T^  such  that  if  T > T^  , 

2 

and  one  is  given  arbitrary  states  (w0>  vq)>  (wt>  vt)  « then  there  is 

2 

a control  u « L (r  X [0,  T)  such  that  the  solution  of  (5.  25),  (5.  26), 
(5.27)  and 
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w(-  , 0)  = w0»  v(-  , 0)  = t^(- , 0)  = VQ  (5.  29) 

is  steered,  by  the  action  of  the  control  u,  to  the  terminal  state 

Ow 

w( • , T)  = wT,  v( • , T)  = — (•  , T)  = vT  . (5.  30) 

2 2 1 

The  appearance  of  the  space  V Q(  ft)  C H (ft)  © H (ft)  here  is  somewhat 
curious  and  immensely  inconvenient.  We  will  see  in  a later  section  that 
it  is  rather  important  to  be  able  to  control  all  finite  energy  states,  i.  e.  , 
all  states  (w,  v)  « Vg(ft)  C H*(ft)  © H°(ft)  for  which  w(x)  = 0,  x € . 

We  will  comment  on  this  further  after  we  sketch  the  proof. 

Proof  of  Theorem  5.  3 (Sketch).  The  method  used  here  is  a very  simple 
one  - completely  "organic"  in  a sense,  as  the  idea  is  to  remove  artificial 
barriers,  let  "nature  take  its  course"  and  then  emulate  nature's  methods. 

It  is  similar  in  form  to  the  proof  ([93])  of  Theorem  2.  10. 

For  some  positive  e > 0 let 

ft  = {x  « Rn  | x <j  ft* , 1|  x - x ||  < e 

e 

* . 

for  some  x c 12},  (see  Fig.  5.1), 

sj< 

that  is,  £2  is  the  " e -neighborhood  of  ft  outside  ft  " . Let  (cf.  (5.28)) 
e 

V2(Rn  - ft*)  = {(w,  v)  s H2(Rn  - ft*)  © H1(Rn  - ft ‘ ) | 
e 

:]<  sj< 

w(x)  = v(x)  = 0,  x € r = Oft  , and 
w(x)  = v(x)  = 0,  x <j  ft^}  . 

Now  it  can  be  shown  (see  [58],  [93])  that  there  is  a bounded  linear  trans- 
formation 
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2 2 n * 

F:  V (ft)  — V (R  - ft  ) 

0 C 

A A 2 

with  the  following  properties  for  (w,  v)  e VQ(ft): 

a a >se  '-e  2 n 

F(w  f v)  s (w  , v ) c V£(R  - 52  ) I 

we(x)  = w(x),  vf  (x)  = v(x),  x £ ft  ; 


(5.  31) 


1,^6 
l(W  , 


A 6 
V 


I <K  lJ(w,  v) 

v (R  -ft  ) 
e 


2 

V") 


= F 


' 2 
vo(n) 


Thus  F extends  each  (w  , v )eV^(ft)  to  a pair  of  functions  (we,v6) 

n * 

with  comparable  smoothness  in  R -ft  with  the  additional  property  that 
g /v  e 

w , v vanish  outside  ft  . 

z 

2. 

Now  let  (w  vQ)  be  an  arbitrary  state  in  VQ(ft)  and  consider  the 
equation  (5.2  5)  in  the  region  Rn  - ft  with  initial  state  being  the  above 
described  extension: 


(w(- , 0),  v(*  , 0))  = F(wq,  vq)  s (w  ®,  v®)  . (5.32) 

A 

Following  the  well-established  theory  for  solutions  of  (5.  25),  let  w be 

n * 

the  solution  of  (5.  32)  for  x e R - ft  , t >_  0,  with  these  initial  data 
and 

w(x,  t)  = 0,  x € ft  , t > 0 . (5.  33) 

a.  n * 

The  behavior  of  the  solution  w in  the  region  R - ft  exterior  to 

sje 

the  star- shaped  region  ft  has  been  the  object  of  a great  deal  of  study 
by  Lax,  Phillips  [53],  [54],  Morawetz  [7  0],  [71],  [72],  [53],  Strauss  [106], 
Ralston  [80],  [81]  and  others.  It  is  still  a field  of  considerable  interest 
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with  many  questions  unresolved  to  the  present  day.  Nevertheless,  from 
the  existing  results  we  know  the  following.  If,  as  we  have  assume  :,  the 
support  of  the  initial  state  (5.  32)  lies  in  a bounded  region,  such  as  our 

r ^ 

region  U , and  it  we  let  w denote  the  restriction  of  the  solution  w 
e’ 

of  (5.  2 5),  (5.  32),  ( 5.  33)  to  another  bounded  region,  such  as  our  U , 
then  there  exist  positive  numbers  M,  y such  that  for  all  t ^ 0 


(wf(-  , t),  vV  , t)ll  2 n — 

K (R  - Q 
z 


(5.  34) 


(The  results  cited  above  actually  establish  (5.  34)  only  with  respect  to 
the  energy  (equivalently,  h\u)  © H°(S2))  topology  but  the  extension  to 
(5.  34)  for  solutions  initiating  in  V“(f2)  is  immediate.)  The  basic  idea 
is  that  all  solutions  whose  supports  initially  lie  in  a bounded  region, 
uniformly  dissipate  their  energy  "outward"  into  space  away  from  the 
star- shaped  region  q"  . The  theory  has  recently  been  extended,  at 
least  in  R2,  to  regions  Q which  are  not  necessarily  star-shaped  but 
have  the  property  of  not  "trapping  rays"  [72],  [106].  An  extension  to 
hyperbolic  systems  has  also  been  obtained  [81]. 

Consider  again  the  solution  w of  (5.  25),  (5.32),  (5.  33)  for 
x « Rn  - Q , t > 0,  but  with  reference  to  its  restriction  to  the  surface 

r x[0,  <«)  which,  by  the  star-complementation  conditions,  lies  in 

* ~~~~~  & w 

Rn  . ^ . More  precisely,  consider  the  behavior  of  there.  From 

the  properties  of  w and  the  general  trace  theorem  [1],  [58]  we  know 
that,  for  any  fixed  T > 0 , 
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rx  xfo,  T] 


« C (fo,  T] ; H^Tj)) 


C L (T.  X[0,  T]) 


and  further,  for  some  B > 0 , 


“lfr“  2 i bIIc%>  %)»  2 „ — 1 bII rll  llw  , 0 II 

l (r  Xfo,  Tj)  0 0 vV-S!  i 0 0 vha, 

1 z 0 

Having  set  all  of  this  down  we  are  ready  to  proceed.  We  choose 
T large  enough  so  that  (cf.  (5.  31),  (5.  34)) 


M e" vT  < 1 . 


Then  we  take  (w  q , v q)  € VQ(f2)  and 

(a)  extend  to  (w'n,  vj  « V^(Rn  - i2 " ) via  F ; 

U U £ 

(b)  construct  the  solution  w of  (5.25),  (5.32),  (5.  3 3)  with 
(w  (•  , 0),  v(-  , 0))  = w g,  v ^ as  the  initial  state  ; 

x dw 

(c)  set  u(x,  t)  =— (X,  t),  x € r , t€  [0,  T]  . 

^ r 

/n  /v  r r 0 w 

We  note  then  that  u,  w and  v (=  ) jointly  satisfy  ( 5.  2 5),  (5.29), 

(5.  26),  (5.27)  and,  by  the  usual  uniqueness  theorems,  constitute  the 
only  solution  of  that  mixed  initial  initial -boundary  value  problem.  We 
note  that  (5.  35)  gives 

llw  (•  , T),  vr(-,T))||  2 <Me‘YT||(w®,  v®)||  — 

V («)  V (Rn-fi"'  ) 

u e 


< IIfIIm  e-^T  ll(w0,0o)ll  2 


(5.  36) 


vo(n) 
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Now  let 


(w  j,  v = -(wr(- , T),  vr(-,T))  , (5.37) 

(w  j , vj ) = F(w  j,  v ^ . (5.  38) 

that  is,  we  extend  the  negative  of  the  terminal  state  obtained  by  the 

~ e ~ e ^ e ~ e 

process  (a),  (b),  (c)  above  to  a state  (w^,  v ) which,  like  (w^,  vq)  > 

lies  in  v2(Rn  -£?').  Then  we  run  the  whole  "(a),  (b),  (c)"  process 
e 

in  reverse  from  t = T to  t = 0 . We  let  w be  the  solution  of  (5.25), 

~ ~ n * 

(5.  33)  with  terminal  state  (w,v)  at  time  T in  R - ii  , t < T . We 

~ 3 w ~ £ 

let  u (x,  t)  = (x,  t),  x e Tp  0 < t < T and  again  have  u « L (Tj  X [0,  T]) 

Then  u , w r,  v r,  the  last  two  being  the  restriction  of  w,  v to  x e Q , 

jointly  satisfy  (5.25),  (5.26),  (5.27).  We  let 

(w0,  vQ)  = (wr(-  , 0),  vr(-  , 0))  . 


Now,  since  (5.25),  (5.  33)  are  invariant  under  time  reversal,  the  decay 

~ A 

properties  of  w backward  in  time  are  the  same  as  those  of  w forward 
in  time.  Thus  we  now  have  using  (5.  35),  (5.  36), 


II  F ||  M e' Vr||(wr(-,T), 


^ r 
v 


(•  , T))  II 


< II  F ||  ^ 


.,2  -2yT 
M e 


0 2 

Vn) 


(5.  39) 


Let 


w(x,t)  = wr(x,  t)  + wr(x,  t),  x e n,  0 < t < T . 

From  the  material  above  we  see  that  w satisfies  (5.  25),  (5.  26)  and 
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A A 


(w(-  , 0),  v(-  , 0))  = (wQ,  vQ)  + (wQ,  VQ) 


bw  8 w 3 w ^ ~ 

— (X,  t)  = -T—  (X,  t)  + — (X,  t)  = U (X,  t)  + U (X,  t)  2 U(X,  t) 
ov  civ  dv 


(5.40) 


x*  r,  o < t < t , 


and,  because  of  (5.  37), 

w(x,  T)  s v(x,  T)  s 0,  x e U 


In  other  words,  the  control  u = u + u steers  the  solution  w of  (5.25), 
(5.26),  (5.  27)  from  the  initial  state  (5.  40)  to  zero.  Thus,  if  (5.40)  can 
be  an  arbitrary  state  in  V^(Q)  we  have  exact  null  controllability  in 
Vq(£2),  that  is,  we  can  steer  any  point  in  V ^(t2)  to  zero  in  time  T . 
Now  this,  in  fact,  is  the  situation.  For  (w  , v ) is  obtained  from 
(w  , Vg)  by  a linear  map,  call  it  K . The  requirement  that  (5.29)  and 


(5.40)  should  agree  is  then 

(I  * K)  (w0>  0o)  = (wQ,  v0)  . 

Since  (5.  39)  shows  that  ||k||  < 1,  (5.41)  is  uniquely  solvable  for 


(5.41) 


(w  , V ) « ^0<n)  and  (5‘29)  may  thus  be  satisfied  by  appropriate  choice 


A A, 


Of  <w0,  v0)  . 

To  complete  our  proof  we  merely  note  that  the  invariance  of  (5.2  5), 
(5.26),  (5.27)  under  time  reversal  shows  the  problem  of  controlling  from 
a zero  initial  state  to  an  arbitrary  terminal  state  (w  , Vj)  to  be  equivalent 
to  the  null  controllability  problem  solved  above.  Then  simply  adding  the 
two  solutions  we  solve  the  problem  as  originally  posed  and  the  proof  is 
complete. 
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We  can  now  complete  the  remarks  made  following  the  statement  ' f 

our  theorem.  Our  restriction  to  states  in  H^(£2)  1 j.  • H*(£2)  is  made  necessary 

by  the  very  nature  of  the  trace  theorem,  of  which  we  clearly  make  essential 

^ ~ 2 n — • 

use.  For  such  states  the  solutions  w,  w lie  in  C([0,  T] ; Hf^(R  -£2  )) 

o  w o w 

and  the  truce  th'-orem  gives  - — , — , and  consequently  the  control  u , 

Ov  Ov 

i 

as  elements  of  C ([0,  T]  ; H ?(  Tj  x [0,  T])  . If  we  use  the  " finite  energy" 

1 0 - A 

space  H ( £2)  © H (£2)  we  get  u e C ([0,  T];  H ?(Tj))  by  the  same  process 

and  u is  a distribution  of  a particular  type  rather  than  a square  integrable 
function.  This  might  appear  to  be  merely  a matter  of  mathematical  nit- 
picking but,  in  fact,  it  is  not.  We  will  see  why  in  Section  6. 

We  actually  conjecture  that  Theorem  5.  2 is  true  with  H^(£2)  © H°(£2) 

2  1 

used  instead  of  H (£2)  © H (£2),  provided,  of  course,  that  the  star- 
complementation  condition  remains  in  force.  We  also  conjecture  thar  in 
general  it  is  not  true  with  either  of  these  spaces  when  the  star  comple- 
mentation condition  fails.  Proof,  or  disproof,  of  these  conjectures  is, 
arguably,  one  of  the  most  significant  currently  unsolved  problems  in  the 
theory  of  control  of  partial  differential  equations.  However,  despite  the 
absence  of  complete  proofs,  we  are  not  without  persuasive  support  for 
the  validity  of  our  conjectures.  This  also  will  be  made  clear  in  Section  6. 

We  conclude  this  section  by  noting  (at  least)  a recent  development 
which  seems  to  shed  some  light  on  the  method  of  proof  which  we  have 
used  in  Theorem  5.  3.  Recent  work  due  to  Baras,  Brockett,  Helton  ([3], 

[4  ],  [35],  [36],  which  we  do  not  review  here  because  it  is  primarily 
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related  to  the  algebraic  theories  of  representation  of  control  systems, 
shows  that  there  is  a deep  underlying  relationship  between  the  scattering 
theory  of  Lax,  Phillips  ([54])  and  others  and  control  theory.  This  makes 
the  use  of  scattering  theory  results  in  the  proof  of  Theorem  5.  3 seem 
much  less  "accidental"  than  otherwise  would  be  the  case. 


6.  ENERGY  DECAY  TOR  THE  WAVE  EQUATION  WITH  BOUNDARY  DAMPING 


Our  intention  in  this  section  is  to  develop  results  comparable  to 
Theorem  3.  9,  but  now  in  the  context  of  the  wave  equation 


a2  n 
d w y 


p(x)  2 

3t  i,  j = l 9x 


9 i,  . 9w 
— j Q-.(X)  ■ ) = 0 

1 > 9x 


(6.1) 


for  t > 0 and  x c 12,  a domain  in  R as  described  in  the  previous  section. 
Again  we  suppose  r = 912  divided  into  two  regions:  r = r ll  r , with 


r relatively  open  and  non-empty.  We  suppose 

w(x,  t)  = 0,  x € rQ,  t > 0 


9w 


and,  for  (3  > 0,  we  impose  the  condition  (v(x,  t)  s (x,  t)) 
9w 

v(  x,  t)  + (3(—  (x,  t),  A(x)  v(x))  n = 0,  x*  rlf  t >0 


(6.2) 


(6.  3) 


where,  as  before,  v denotes  the  exterior  normal  direction.  When  12  is 
an  interval  in  R*,  p=  A = 1,  and  Fj  is  one  endpoint,  say  x = 1,  (6.  3) 
becomes 

3 w 9w 

which  after  transformation  (see  (4.  5) f f.  ) takes  a form  included  under 
(3.  64)  of  Assumption  3.  8 . In  particular,  this  is  a dissipative  boundary 
condition;  supposing  an  initial  state  (cf.  (5.28)) 


3w 


(w(-,0),  v(-  , 0)  (=  -g^-(*  , 0)))  = (W  ,vQ)  € V0(12) 


(6.4) 


the  resulting  solution  w = w(x,  t)  of  (6.  1),  (6.  2),  (6.  3)  lies  in  C [0,  *)  ; 
V^(12))  and  we  may  differentiate  the  energy  expression  (cf.  (5.16)): 
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~ E(w(-,t),v(-,t)  f [ p(x)(v(x,t))Z  + (^(x,  t),  A(x)^(x,  t))  n ] dx 


2 2 

w 9 w(x.  t)  3w 

p(x)v(x,  t) 7 (*>  t)  t ( ■ Tjj;  ' , A(x)  — (x,  t))  dx 

"n  at  dtdx  ax  e 


= (after  use  of  the  divergence  theorem  and  (6.  1)) 

= f fv(x,t)  t),  A(x)  v(x))  Ids  = (using  (6.  2),  (6.3)) 

r E 


f (v(x,  t))Z  ds  < 0 


(6.  5) 


We  have,  then,  a situation  much  like  that  of  Theorem  3.  9.  The 
energy  is  non-increasing  but  it  is  entirely  possible  that  v(x,  t)  = 0 , 
x € Tj,  for  t in  some  interval  of  finite  length,  foreboding  some  difficulty 
in  an  attempt  to  prove  that 


lim  E(w(- , t),  v( • , t))  = 0 

t — oo 


(6.  6) 


Actually,  (6.6)  can  be  proved  under  no  more  assumptions  than  we 
have  already  made.  We  make  direct  use  of  the  extended  invariance 
principle  [34]  as  enunciated  by  Hale  together  with  Theorem  5.  1 of  the 
previous  section.  As  at  the  beginning  of  Section  5,  we  assume 
non-empty  and  relatively  open  in  9q  . 

Theorem  6.  1,  Under  the  above  assumptions  on  the  system  (6.1),  (6.2), 
(6.  3)  we  have  (6.  6)  for  each  generalized  solution  w = w(x,  t)  corre- 
sponding to  an  initial  state 
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(w(.,0),  v(-,0))  - (wQ,  vq)  € VQ(n)  (=  ^w>  v) 

£ H1  (ST2)  © H°(S2),  w(x)  = 0,  xe  r . Called  Z in  ( 5.  9) . ) (6.7) 

Sketch  of  Proof.  Though  we  want  to  establish  (6.  6)  for  initial  states 
(6.4)  in  Vq(S2),  we  shall  first  consider  a solution  w = w(x,  t)  corre- 


sponding to  an  initial  state  (cf.  (5.2  8)) 

(w  ('  , 0),  v ( • 0))  = (w0,v0)  e V^(J2) 


(6.8) 


Then  (w(-,t),  v(-,t))e  C ([0,  *>);  V ( S2))  and,  carrying  out  (6.  5),  we 
have  E(w(-,t),  v(-,t))  non-increasing  and  hence 


lim  E(w(-,t),  v(.,t))  = E^  > 0 . 
t -* 


(6.  9) 


This  next  step  is  to  show  that  (w  (•  , t),  v (•  , t))  remains  in  a 
2 

bounded  subset  of  V (S2)  . This  is  done  (see  [79])  by  establishing  that 

8w  9 w ~ 9 v . . 

E(— (-,t),  — ( - , t))  = E(v  (•  , t),  — (-,t)) 


at 


= i / T,  -V  (.;<*)  + (^(x,t),A(x)^x(x,t))  }dx 

n J i,  j=l  9x  E 


9w 

dx 


is  non-increasing.  Now  it  is  known  ([1])  that  bounded  subsets  of  Vq(^) 
are  pre-compact  with  respect  to  the  topology  (cf.  (6.7))  of  > 

which,  as  we  have  noted  earlier,  is  derived  from  a norm  whose  square 
is  essentially  equivalent  to  the  energy,  E . From  this  one  can  establish 


that 


lim  || w (•  ,tk),  v(- , tfc)  - (wQ,  vQ)  II  1 
k -►  oo 


= 0 


vQ(n) 


-ill- 


1 


A 2 

for  some  sequence  {t^}  with  lim  t^  = <*  and  for  some  (wq>vq)  £ VQ(S1) 

oc 

2 

(since  the  weak  limit  with  respect  to  the  V JT>)  topology  must  agree 
with  the  strong  limit  in  the  V^fQ)  topology).  One  then  shows  that  the 
solution  w = w(x,  t)  of  (6.1),  (6.2),  (6.  3)  with 

(w(.  , 0),  v(- , 0))  = (wQ,  vQ) 

has 

E(w(-  , t),  v(-,t))2EB  (cf.  (6.9)),  t > 0 (6.10) 

v( x,  t)  S o,  x £ Tj,  t > 0 . (6.  11) 

But  then  Theorem  5.  1 gives  v(x,  t)  = 0 whence  = 0 and  (6.  9)  gives 

lim  E(w(*  , t),  v (- , t))  = 0 . 

t — OC 

Approximating  the  initial  state  wgvQ  in  *7^(17)  by  states  (wq>vq)  *n 
V^(12)  one  readily  obtains  (6.6)  for  initial  states  in  Vg(ft)  and  the  proof 
is  complete. 

It  is  clear  that  the  scheme  applied  above  is  just  that  laid  down  in 

[34].  An  earlier  treatment  than  [79],  the  paper  [104]  by  Slemrod,  used 

similar  methods  to  establish  that 

lim  (w(-  , t),  v(-  , t)  = 0 
t-*  oo 

in  the  weak  topology  of  V^fi),  making  use  of  the  weak  pre -compactness 
of  bounded  subsets  of  that  space. 

Now  the  question  arises:  what  more  can  we  prove  when  the 
stronger  controllability  result  of  Theorem  5.  3 obtains;  i.  e.  , when  we 
consider  the  equation  (5.  25)  ( p(x)  a 1,  A(x)  = I)  and  add  the  assumption 
that  the  pair  (i2, r ) is  star  complemented ? In  this  case  our  system  is 
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„2 
d w 

atz 

w(x,  t)  = 0,  x e Tp  t _>  0 . (6.  13) 

|p(x,  t)  + p (x,  t)  = 0,  x e Tj,  t > 0 . (6.14) 

It  should  be  noted  that  one  could  obtain  the  above  as  a "closed 
loop  system"  realized  via  use  of  a linear  feedback  law 

u(x>t)  = ' W (x’t}’  x e ri»  1 - 0 

substituted  in  the  control  system  (5.  25),  (5.  26),  (5.  27)  or  one  could 

envision  this  system  arising  naturally  as  the  result  of  some  dissipative 

mechanism,  such  as  friction,  acting  on  the  portion  of  the  boundary. 

For  n = 1 Theorem  3.9  applies  to  show  that  the  energy  decays 

exponentially,  corresponding  to  the  fact  that  finite  energy  states  can  be 

2 

controlled  with  controls  u e L [0,  TJ  . In  the  present  section  we  are 
primarily  concerned  with  the  "higher  dimensional"  cases  n > 2,  where, 
as  we  shall  see,  the  situation  is  decidedly  more  complicated. 

If  one  takes  further  care  in  Section  5 one  can  show,  in  the  context 
of  the  exact  controllability  results  presented  there,  that  for  the  control 
problem  (5.  25),  (5.26),  (5.  27),  (5.29),  (5.  30)  we  have 

Hull  < K(||(w  , v )||2  + II  (w  v )|| 2 ) (6.15) 

l (r1  x [o,  t])  v0(n)  v0(«) 

for  some  positive  constant  K . It  is  also  possible  to  see,  and  this  will 
prove  important  in  the  sequel,  that  there  is  a constant  K > 0 such  that 


n 2 
V « w 

u i 2 
i=l  (9x  ) 


= 0,  x c S2,  t > 0 


(6.  12) 
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~\ll,  < K(ll(w„,v  |||2  * II  (w-.v,)!!2  , 

,Jt  t'lf,  K[0,T]|-  0 ° *2<Si)  1 1 V2<«) 


) (6.  16) 


The  fact  that  we  can  prove  the  controllability  only  for  states  in 
V^(S2),  rather  than  in  VQ(£2)  , has  a definite  weakening  effect  on  what 
we  can  prove  in  regard  to  the  decay  of  energy  of  solutions  of  (6.  12), 

(6. 13),  (6.  14). 

Theorem  6.  2 . Let  the  pair  (S2,  r^)  be  star-complemented.  Then  every 
solution  w = w(x,  t)  of  (6.  12),  (6.  13),  (6.  14)  satisfying  an  initial  condition 
(w(-  , 0),  v(-  , 0))  = (wQ,  vQ)  c vJ(B)  (6.  17) 

has  the  property  that  the  energy 

E(w(-,t).  v(*,t))  = j f [(v(x,  t))  + v ( j-(x,  t))  ]dx 

Q k=l  9x 


satisfies  an  inequality 

c(w  v ) 

E(W(- ,t),  v(-  ,t))  < — YTt ’ t-°  (6-18) 

where  C(w  , v ) depends,  in  general,  on  ||(w  , v )|| 

%(n) 

Sketch  of  Proof.  We  proceed  with  a view  to  treating  the  system  (6.  12), 
(6.  13),  (6.  14)  in  much  the  same  way  as  we  did  the  corresponding  hyper- 
bolic system  in  Theorem  3.9. 

Let  us  suppose  that  we  have  a solution  w(x,  t)  of  (6.  12),  (6.  13), 
(6.14)  corresponding  to  an  initial  state  (6.17).  The  solution  w then 
lies  in  d(fo,oo);  v^(Sl))  H n *([0,  =r);  (see  [58],  [59])  and  one 

may  differentiate,  as  in  (6.  5),  to  obtain 
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(6.  18) 


d 

dt 


E(w(- , t),  v(- , t))  = - ■-  J (v(x,  t))Zds  . 


Letting  T >T  , the  positive  number  introduced  earlier  in  Theorem  5.  5 
we  have 


E(w  v ) - E(w(.fT),  v("  , T) ) - T / f (V(x,  t))Z  ds  dt  . (6.  i9) 


o r 


We  now  return  to  the  control  problem  which  we  treated  in  Theorem  5.  3, 
specialized  to  a zero  initial  condition  and  a particular  terminal  condition 


defined  in  terms  of  the  solution  w described  above; 

rZ  n 2 

ay  v1  o v 

— j-  - ) — = °,  x « n,  t > o 

3t  k=l  (3xk) 


(6.  20) 


3y 


y(x,  t)  = o,  x € rQ,  t > 0,  — (x,  t)  = u(x,  t),  x € Tj,  t > 0 (6.  21) 


3y 


dy 


9w, 


y(x,  0)  = 37  (X,  0)  0,  y(x,  T)  = w(x,  T),  ~~~  (Xj  T)  = -^x,  T)  , 


at 


at 

X € 12 


(6. 22) 


We  have  seen  that  we  can  find  a solution  y(x,  t),  x € 12,  0 < t < T,  for 
this  problem.  A short  computation  similar  to  (6.  5)  then  shows  that 

2 E(w(-  , T),  v(-  , T))  = / (~(x,  T)  |f(x,  T)  + V — k(x,  T)  ^ (x,  T))dx 


12 


k=i  ax 


3x 


/3y  ow  vn  tfv  dw 

(— (x,0)  — -(x,  0)  + Yj  -t-(x,  0)— j-(x,  0))dx 

12  k = l 3x 


Bw 


3w 

I? 


r r 3w  3v  3w 

= J J (-gT-(x,  t)  u(x,  t)  + -i-(x,  t)  — (x,  t))ds  dt  = (cf.  (6.  14)) 


0 r. 


at 


0 V 


= f f ^ It  <x>  t)lds  dt  • 


o r. 


p at 


(6.  23) 
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Applying  th 


r r 9w  1 

J J (TT(x-t))  ds  dt  _ 


■-  ■' -<w(-  , T),  v(-  , T)) 


o r, 


f f (u(x,  t)  - f ff  (X,  t))Z  ds  dt 

‘or,  p 9t 


E(w(-  , T),  v(-  , T))' 


llu||zz  m^)zII^IIz2 

L (Tj  X[0,  T])  ' L (Tj  X [0,  T]) 

> (cf.  (6.  15),  (6.  16),  (6.  22)) 


il(w(-  ,T),  v(.  , pr 


m||(w(-  , T),  v(-  , T) ) || 


(6.24) 


for  some  M > 0,  independent  of  the  particular  solution  w . Consequently 
we  now  have,  from  (6.  19), 

E(w  v ) - E(w(-  , T),  v(-  , T))  > 2 • (6.  25) 

° ° M||(w(-,T),  v(*,T))||  2 

vQ(n) 

The  same  sort  of  argument  applied  on  successive  intervals  [kT,  f k +-1)T] 
shows  that 

E(w(-,kT),  v(*  , kT))  - E(w(-,(k+l)T),  v(- , (k+l)T)) 


E(w(~  , (kt-l)T),  VC  , (kH)T)f 
M||(w(- , (ktl)T),  v(-  , (k+l)T))  II  2 


(6. 26) 


V0(f2) 
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Now,  as  noted  in  Theorem  6.  1,  a slightly  more  careful  study  of  (6.  12), 
(6.  13),  (6.  14)  (see  f 79  ] for  details)  enables  one  to  see  that 


(w(-,t),  v(-,t))||  < B||(w  v )||  , t > 0 . 


V0(a) 


Letting  £ = E(w( • , kT),  v( • , kT)),  k = 0,  1,  2,  . . . , (6.  25),  (6.  26)  and 


(6.27)  combine  to  show  that 


Ek-5=ti-=(w0'V  W k = 


(6.  28) 


/here  c is  a positive  constant  depending  on  the  initial  state  (w^,  v^) 


e V^fQ)  . Thus  the  e satisfy  a quadratic  nonlinear  recursion  equation. 
0 K 


From  (6.  28)  one  can  establish  ([79])  that,  for  some  c(w  , v ) > 0 , 


c(w0,vo) 


Ek-  k+1 


k = 0,1,  2,  ... 


and,  making  use  of  the  fact  that  E(w(-,t),  v(*,t))  is  non-increasing, 


we  have,  finally 


C(W0’V0) 


E(w(  • , t),  v(  • , t))  < —t , t>0  , 


where  C(w  , v ) can  be  bounded  in  terms  of  H(W0>V0)H  2 ' ThiS  iS 


just  (6.18)  and  the  theorem  is  proved. 


Vq(S2) 


We  remark  that  if  C(w  , vQ)  could  be  bounded  in  terms  of 


(w  , v0)ll  1 (which  is  equivalent  to  the  norm  defined  by  E(wQ,  vQ))  , 


V («) 

(6.  18)  would  necessarily  imply  uniform  exponential  decay  of  energy  as 


a result  of  the  semigroup  property  of  the  solutions  of  (6.  12),  (6.  13), 


(6.  14).  But  this  has  not  been  shown  to  be  the  case  and  we  must,  for 


the  moment,  be  content  with  the  result  we  have. 


I 


As  we  have  already  note  i ir,  Section  5,  we  conje  ture  that,  pro- 
vided the  pair  (ft,  r ) is  star-  c implemented,  Theorem  6.  3 can  be 
strengthened  to  give  controllability  in  '<•  (ft)  th  result  (A.  18)  can 
be  strengthened  to  the  extent  • d iem  nstrnting  uniform  exponential  decay 
of  energy.  Curiously  enough,  however,  th'  impi  /ement  f (6.18)  s 
likely  to  be  established  first  and  then  the  strengthened  version  of 
The  rem  6.  3,  following  a method  similar  to  the  proof  in  Section  5 but 

using  (6.  12),  (6.13),  (6.11)  instead  of  the  wave  equation  in  the  "external" 

„ n * 
region  f?  - ft 

In  [33]  K.  D.  Graham  and  the  present  author  have  studied  the  con- 
trol system  (6.20),  (6.21)  for  the  special  case  where  ft  is  the  unit  ball 
in  E1',  Tj  = F = 9ft,  rQ  = 0 . The  method  relies  on  separation  of 
variables  and  reduction  of  the  control  problem  to  an  infinite  collection 
of  moment  problems  similar  to  (4.  16),  but  involving,  in  place  of  the  <r,  , 

th*1  various  zeros  w,  . of  the  Bessel  functions  of  order  j . We  have 

J 

been  iblo  to  establish  in  this  special  case  that  all  states  (w,  v)  e V \ft) 

( H (ft)  • H(  (ft))  can  be  controlled  with  controls  u e L (9ft  X [0,  2 + e]), 
e 0 . Curiously  enough,  however,  the  result  does  not  lead  to  a proof 
of  exponential  decay  for  solutions  of  (6.  12),  (6.  13),  (6.  14)  in  the  cor- 
responding situation.  The  pr  .bh  m is  that  it  does  not  seem  to  be  possible, 

2 

via  this  method,  to  establish  the  inequality  (6.16)  with  V (ft)  replaced 
by  V^(ft)  (though  we  do  have  (6.  16)  with  V^(ft)  replaced  by  v\ft))  . 

This  difficulty  prevents  adequate  treatment  of  the  denominator  of  the 
expression  following  the  first  inequality  in  (6.24). 
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There  is  one  situation  in  which  we  can  report  results  which  back 


Figure  6.  1.  Two  different  configurations  for  the  rectangle 
0 < x < a,  0<y<b  . 
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Hiving  set  this  up  we  take  p ' 0 but  small  and  consider  (6.29),  (6.  30) 
(6.  il)  to  be  a perturbation  of  (6.29),  (6.  30)  and 


— (x,  y,  t)  = 0,  x = a or  y b,  t > 0 . (6.32) 

Now  for  (6.  29),  (6.  10),  (6.  32)  the  eigenfunctions  of  the  infinitesimal 
generator  of  the  relevant  semigroup  on  V ^ (S2)  - V^([0,  a]  x[0,b])  can  be 
computed  explicitly.  There  are  infinitely  many  eigenvalues  o-,  all  of 
finite  multiplicity,  constituting  a set  2 , which  lie  on  the  imaginary 
axis.  There  is  also  the  eigenvalue  0,  and  it  is  of  infinite  multiplicity, 
the  associated  eigenspace  consisting  of  states  (w,  v)  with  v = 0 and 
w a solution  of 

a2  s2 

9 w 9 w 

z~j  f rr 

9x  9y 


w(x,  y)  = 0,  x = 0 or  y = 0 ( r ) 


w(x,  y)  = g(x,  y),  x = a or  y = b 


(r2> 


with  g e H 2(T^)  . This  is  the  situation  for  p = 0 . 

As  we  allow  p to  increase  from  zero  it  turns  out  that  the  eigen- 


values originally  in  2 move  to  a set  2^  with 


E C {z  € d|Re(z)  < -6  } , 

r P 

6 being  greater  than  zero  when  p > 0 . This  is  not  astonishing  and 
it  is  precisely  what  we  want.  More  surprising,  perhaps,  is  the  fact 
that  the  eigenvalue  0 of  infinite  multiplicity  passes  into  a countably 
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infinite  set  Z ^ of  real,  negative  eigenvalues  which,  for  any  (3  > 0,  no 
matter  how  small,  have  - as  their  only  limit  point.  As  a consequence 
it  is  seen  that  (6.29),  (6.  30),  (6.  31)  cannot  be  time  reversible  when 
(3  > 0 - the  process  is  dissipative  in  an  essential  way,  just  as  heat 
conduction  processes  are. 

However,  the  fact  that  all  system  eigenvalues  for  (6.29),  (6.  30), 
(6.  31),  p > 0 but  small,  are  uniformly  displaced  into  the  left  half  plane 
allows  one,  after  certain  analysis  of  the  eigenfunctions  also,  to 
establish  that  for  this  system  one  has 

E(w(-,t),  v(- , t))  < Ce'f  ' E(w(- , 0),  v(- , 0)),  t>0 


with  C,  6 > 0 and  independent  of  the  particular  state  (w(-  , 0),  v(-  , 0))  . 
That  is,  we  have  uniform  exponential  decay  of  energy  in  this  case. 

The  results  concerning  decay  of  solutions  of  the  wave  equation  in 
bounded  subsets  of  Q referred  to  earlier  in  this  section  are  usually 
obtained  now  (this  was  not  true  originally;  see  e.  g.  , [70])  with  the  use 
of  the  method  of  geometrical  optics.  We  have  some  hope  that  it  may  be 
possible  to  employ  this  method  to  study  the  decay  of  solutions  of  (6.  12), 
(6.  13),  (6.  14)  when  the  star-complementation  condition  is  satisfied. 
Careful  attention  will  have  to  be  paid  to  the  reflection  of  waves  and 
rays  from  the  energy  absorbing  surface  . One  aspect  of  the  problem 
indicates  caution.  The  method  of  geometrical  optics  is  basically  associ- 
ated with  wave -type  phenomena.  But  the  presence  of  real  eigenvalues 
tending  to  - ac  (the  set  Z^)  in  the  case  of  (6.29),  (6.  30),  (6.  31) 
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indicates  that  some  processes  which  are  not  really  of  wave  type  may 
well  be  operative  in  the  context  of  these  systems. 

Some  indication  that  the  star-complementation  condition  is  necessary 
may  be  obtained  by  considering  the  situation  displayed  on  the  right  of 
Tig.  6.  1,  where  we  again  consider  (6.  29)  but  now  suppose  that  (6.  30) 
holds  on  the  three  sides  there  indicated  as  r while  (6.  31)  holds  only 
on  the  single  side  x = a . In  this  case  it  is  shown  in  [79]  that  as  (3 
is  increased  from  0 the  set  of  eigenvalues  2^  is  not  uniformly  dis- 
placed from  the  imaginary  axis  - a subsequence  of  eigenvalues  asymp- 
totically approaches  it.  In  this  case  there  can  be  no  uniform  exponential 
decay  and  there  can  be  no  controllability  in  Vq(O)  (since  the  latter 
would  imply  the  former).  Whether  the  weaker  result  of  controllability  in 
Vp(fl),  which  is  all  we  have  in  Theorem  5.  3 even  with  star-complementa- 
tion, can  still  be  obtained  in  this  case  is  unknown;  it  does  not  seem 
likely. 

If,  by  any  means,  it  is  possible  to  prove  the  exponential  decay 
of  solutions  of  (6.  12),  (6.  13),  (6.  14)  it  will  lead  to  a comparable  con- 
trollability result. 

Theorem  6.  3.  If  for  some  (3  > 0 all  solutions  of  (6.12),  (6.13),  (6.14) 
have  uniform  exponential  decay  as  t -»  »,  i.  e.  , if  for  some  C,  6 > 0 

II  (w(- , t),  v(- , t))||  < C e"'>t||  (w  , v )||  , t>0  , (6.33) 

Vq(S2)  VQ(fi) 

then  the  system  (5.  25),  (5.26),  (5.27)  is  controllable  in  Vq(S7)  with 
controls  u c L2(F^  X [0,  T])  for  some  T > 0 . 
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Sketch  of  Proof . The  proof  again  parallels  that  of  Theorem  2.  9 and 

A A 

that  of  Theorem  5.  3.  One  begins  with  an  arbitrary  state  (wQ,  vQ)  in 
1 

V Q(0)  and  supposes  that  w(x,  t)  is  the  solution  of  (6.  12),  (6.13),  (6.14) 
with  this  initial  state.  Taking  note  of  (6.  33),  we  choose  T so  large  that 

C e'6T  h d < 1 . 


Then  we  let 


✓N  ' ' vv 

W[  = w{  - w( ■ , T),  Vj  = v1  - v(- , T)  (=  v2  - — (• , T)) 

where  (w^v^)  € V ^(O)  is  the  desired  terminal  state  to  be  achieved  at 
time  T . Then  we  let  w (x,  t)  be  the  solution  of  (6.  12),  (6.13)  and 


t-(x,  t)  -(3  — (x,t)  = 0,  x e r. , t < T 


(6.  34) 


W(-,T)  = w j,  ^(-,T)  - . 

Since  time  reversal  leaves  (6.12)  and  (6.13)  invariant  and  carries  (6.14) 
into  (6.  34),  we  have 

llw(.,T-t),  v ( • , T - 1)  ||  <Ce'6t||(w  v )|| 

VQ(S2)  VQ(n) 

Thus  letting 

wQ  = w(*  , 0),  vQ  = v(-  , 0) 

we  find  that  with 


9w  9w 

u(x,  t)  = 17  (*1^  + x « Tj,  0 < t < T 


(6.  35) 
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we  have  u steering  (w0  + wQ,  v + vQ)  to  {Wj,  v^)  during  [0,T].  The 
same  sort  of  contraction  fixed  point  argument  as  employed  in  Theorem  5.  3 
allows  us  to  put 


w„ 


W„  = W„, 
0 0’ 


v = v 
0 0 


where  (w^,  v^)  e v ^ ( 12)  is  the  given  initial  state. 

Now,  how  can  we  see  that  u t l/(  x [0,  T])?  We  do  not  get  this 

I 

from  the  theorem  of  traces;  that  would  still  give  u t C([0,  T] ; . 

What  one  observes,  rather,  is  that  the  energy  E(w,  v)  is  equivalent  to 
II  (w,  v)||  , (assuming  r is  not  empty)  and,  e.  g.  with  regard  to 

vo(n)~ 

the  solution  w(x,  t)  , 


E(w  (•  , 0),  v ( * , 0))  - E(w  (•  , T),  v (•  , T)) 


(6.  36) 


1 r r 9w  Z 

= p / /p(87  (X’t)}  d3dt  ’ 


the  equality  following  from  the  fact  that  w satisfies  (6.  34).  Since  for 

solutions  w in  Vg(fi)  the  left  hand  side  of  (6.  36)  must  be  finite,  the 

right  hand  side  must  be  also.  A similar  argument  applies  to  w and  then 
Z 

(6.  35)  gives  u « L ( Tj  X [0,  T])  . It  should  be  noted  that  we  have  no 
indication  that  u t C([0,  T],  L^(r^))  . 

With  this  we  conclude  the  present  section,  whose  purpose  has  been 
to  demonstrate  that  some  of  the  relationships  between  controllability  and 
stabilizability  which  obtain  for  finite  dimensional  systems  extend  to  the 
higher  dimensional  wave  equation  as  well.  Much  remains  to  be  done, 
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quite  clearly,  first  of  all  in  extending  Theorems  5.  3 and  6.  2 and, 
eventually,  in  constructing,  if  possible,  canonical  systems  as  in 
Section  4 to  investigate  the  spectral  assignment  properties  of  the  system. 
There  are  interesting  questions  connected  with  the  approximate  con- 
trollability result  of  Theorem  5.  2 as  well.  When  Tj  is  just  an  arbitrary 
non-empty  relatively  open  subset  of  9f2  we  know  (from  the  rectangle 
example)  that  we  cannot  expect  controllability  in  V q (^)  in  general. 

But  what  sort  of  controllability  do  we  have?  In  particular,  is  it  possible 
to  steer  the  system  state  from  (w(-  , 0),  v(-  , 0))  = 0 to  states  (w(-  , T)  , 
v(- , T))  = (4>k,  0),  (w(-,T),  v(.,T))  = (0,4>k),  where  <*>k  is  an  eigen- 
function of  the  operator 


Lw  = 


p(x) 


n 

V 


i,  1 = 1 9x 


. i.  , 9w 
( a.(x)  ; ) 

J 9xJ 


with  boundary  conditions 


w(x)  = 0,  x « r 


9w 

o’  ("ax 


(x),  A(x)  v(x)) 


= 0, 


X € 


ri? 


As  yet  we  do  not  know.  It  would  also  be  interesting  to  discover  how 
this  "eigenfunction  controllability"  is  related  to  spectral  assignment 
problems.  When  it  obtains,  is  it  possible  to  move  individual  points  of 
the  spectrum  via  state  feedback  without  affecting  other  points  of  the 
spectrum?  These  and  many  other  questions  wait  their  turn  for  treatment. 
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7.  CONTROL  THEORY  OF  PARABOLIC  SYSTEMS;  RESULTS  TOR  THE  CASE 
OF  ONE  SPACE  DIMENSION 

The  control,  observation  and  stabilization  theory  for  the  heat 
equation  is  very  different  in  character  from  that  of  the  wave  equation  and 
yet,  as  we  shall  see,  very  closely  related  to  it.  It  has  a somewhat 
longer  history  than  that  of  the  wave  equation.  In  his  book  [7],  A.  G. 
Butkovskly  devoted  a great  deal  of  attention  to  this  equation  and  notes, 
e.  g.  , application  to  problems  of  pre-heating  ingots  preparatory  to  rolling 
operations  in  the  metal  working  industries.  From  his  work  it  appears  not 
inaccurate  to  say  that  he  was  primarily  interested  in  optimal  control  and 
only  secondarily  with  questions  of  controllability.  A very  significant 
early  contribution,  from  the  viewpoint  of  controllability  theory,  is  due 
to  Egorov  [18],  [19].  Though  primarily  interested  in  the  problem  of  time 
optimal  control,  he  found  it  necessary,  as  part  of  this  study,  to  charac- 
terize a class  of  reachable  states.  Another  early  paper  is  due  to 
Gal’chuk  [30].  Though  specialized  to  the  problem  of  achieving  a final 
equilibrium  state,  it  presents  results  quite  comparable  to  those  which  we 
will  describe  below. 

A great  many  decisive  developments  in  the  controllability  theory 
of  the  heat  equation  are  due  to  H.  O.  Fattorini.  These  have  been  pre- 
sented in  numerous  articles  (see  e.  g.  [20]  - [26])  snd  form  a large  part 
of  the  "core"  of  the  existing  theory,  including  that  to  be  presented  here. 

We  also  make  reference  to  recent  papers  of  R.  Triggiani  ([107], 
[108]),  concerning  which  we  will  have  more  to  say  in  the  sequel. 
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We  have  seen  in  Sections  3-5  that  the  control  time  T plays  a 
very  decisive  role  in  the  case  of  hyperbolic  systems  in  determining 
whether  or  not  (exact  or  approximate)  controllability  obtains.  In  [20] 
Fattorini  showed  that  there  is  no  comparable  phenomenon  associated 
with  control  of  the  heat  equation.  Indeed  it  can  be  shown  in  many  cases 
that  the  set  of  points  reachable  in  time  T,  starting  with  the  zero  initial 
state,  is  completely  independent  of  T as  long  as  T > 0 . It  is  also 
possible,  in  the  case  of  the  heat  equation,  to  develop  extensions  of  the 
rank  condition  (2.4)  for  controllability,  for  which  no  "realistic"  counter- 
part exists  in  the  case  of  the  wave  equation.  This  question  is  treated 
by  Triggiani  in  [108].  We  shall  briefly  examine  these  questions,  and 
others,  in  the  context  of  the  theory  to  be  developed  below. 

We  will  develop  our  theory  on  two  levels  of  abstraction.  On  the 
one  level  we  shall  consider  an  operator  A on  a Hilbert  space  X which 
generates  an  analytic  semigroup  ([39]).  To  keep  the  exposition  simple  we 
shall  suppose  A to  be  an  unbounded  normal  operator  on  H with  compact 
resolvent  (XI -A)  ^ whose  spectrum  is  confined  to  a sectorial  region 

I arg(z  - zQ)  - tt  I < Gj  < tt  , (7.1) 

where  zQ  is  some  "base  point"  in  the  complex  plane.  The  holomorp’nic 
semigroup  is  then  defined  for  | arg  z | < tt  - 0^  . The  control  system  is 
x = Ax  + Bu,  x e X,  ueU,  (7.2) 

where  U is  a second  Hilbert  space  and  B;  U — X is  a bounded  operator. 
On  the  second,  more  concrete  level,  we  shall  consider  the  equation 
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(7.  3) 


9w  9^w 

— - — 7 f r(x)w  = g(x)  u(t),  t > 0,  0 < x < 

9x"“ 


with  the  function  r continuous  on  [0,1]  and  g e L [0,  1]  . The  boundary 
conditions  are 


8 w dvj 

a0  w(0,  t)  + bQ  — (0,  t)  = 0,  w(l,  t)  + bj  — (1,  t)  = 0 , 


(7.4) 


(a0)“  + <VZ  * °>  (a,)Z  + (bl)Z  * 0 • 


Closely  related  to  this  system  is  the  boundary  control  system 


9w  9^w 

— - — + r(x)w  = 0 , 
9x 


(7.  5) 


9w  9w 

aQw(0,  t)  + bQ—  (0,  t)  = 0,  ax  w(l,  t)  + bj  — (1,  t)  = u(t)  . (7.6) 


The  system  (7.  3),  (7. 4)  is,  of  course,  a special  case  of  (7.  2)  with 

2 1 ^ 

X = L [0,  1],  U = E and  A the  Sturm-Liouville  operator  Lw  = — - - r(x)w 

9x 

with  the  indicated  boundary  conditions.  We  shall  have  more  to  say  con- 
cerning specific  properties  of  this  operator  later. 

We  begin  with  (7.  2)  and  the  question  of  approximate  controllability. 
We  shall  analyze  this  problem  by  means  of  an  extension  of  the  rank 
condition  (2.4),  following  techniques  developed  by  Fattorini  [22]  and 
Triggiani  [108].  We  note  first  of  all,  taking  the  spaces  X,  Y,  Z and  the 
operators  C,  C as  in  (2.  15),  (2.  19)  and  S:  X -►  Z = X as  the  identity 
on  X,  that  approximate  controllability  of  (7.2),  i.e.  , the  question  as 
to  whether  the  range  of  C (cf.  (2.15))  is  dense  in  X is  equivalent  to 
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the  distinguish  ability  question  (Def.  2.  5,  part  (i))  which  in  this  case 
amounts  to:  does 

y(t)  = B e y = 0,  t « [0,  T]  (7.7) 

imply 

yQ  = o ? (7.  8) 

, * 

Since  B,  and  hence  B , is  bounded  and  since  A,  and  hence 

* 

A , generates  a holomorphic  semigroup  of  bounded  operators  the  function 

y(t)  extends  to  a holomorphic  U-valued  function  y(z)  in  the  sector 

(cf.  (7 . 1))  | arg  z |<  tt  - 0^  . It  then  admits  repeated  differentiation  to 

k A t 

give  (note  that  (A  ) e extends  to  a bounded  operator) 

B (A  )k  eA  tyQ  = 0,  t c [0,  T],  k = 0,  1,  2,  . . . (7.  9) 

So  the  above  question  may  be  replaced  by  the  question  as  to  whether  or 
not  (7.  9)  implies  (7.  8). 

It  is  really  not  possible  to  discuss  here  all  of  the  situations  in 
which  it  has  been  established  that  (7.9)  implies  yQ  = 0,  so  that  (7.2) 
is  approximately  controllable.  For  the  case  where  A is  an  unbounded 
self-adjoint  operator  on  the  Hilbert  space  X this  analysis  was  carried 
out  by  Fattorini  in  [20],  making  use  of  the  notion  of  the  ordered  repre- 
sentation of  such  an  operator.  In  [20]  a more  general  result,  where  A 
is  the  generator  of  a semigroup  in  a Banach  space  X,  is  obtained  in 
terms  of  the  spectral  sets  of  the  operator  A . A great  variety  of  theorems 
of  this  sort  are  discussed  and  presented  by  Triggiani  in  [108]. 
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In  order  to  give  some  notion  as  to  the  manner  in  which  results  of 

this  type  are  obtained  we  shall  consider  first  the  case  where,  in  place 

of  our  earlier  assumptions,  A is  taken  to  be  a compact  normal  operator 

on  the  Hilbert  space  X . We  shall  further  suppose  that  if  zero  is  an 

eigenvalue  of  A then  the  associated  eigenspace  is  finite  dimensional. 

Let  the  eigenvalues  of  A be  X^,  X , X ,,  . . . and  let  the  associated 

mutually  orthogonal  eigenspaces  in  X,  be  X^,  X^,X  , ...  each  spanned 

by  the  orthonormal  set  of  eigenvectors  . . . , <t>  , where  n. 

i,l  i,2’  i,  n.’  i 

is  the  dimension  of  X.  . Then  each  element  yQ  e X has  the  unique 


expansion 


n 

oc  i 

y = V V <£ 

° i = l j = l l’J  l'J 


Let  Uj,  u^,  u^,  . . . be  a basis  for  the  control  space  U . For  each  i we 


form  the  array 


i,l)(Ul-B  *i,2)- 

(U„  B <(>, 

1 i,  n. 

’ l 

* 

* 

i,  1>(U2’  ° 

<V  B *i,n. 

1 

(7. 10) 


•v  B\i,(v B*  \ 2>  ■ • • |uk'  B\n.’ 


Then  we  have  the  proposition  (adapted  from  [108]). 
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B 4>  = 0 

since  N is  included  in  the  null  space  of  B . But  B%  = 0 implies 
that  one  of  the  arrays  (7 . 11)  cannot  have  n,  linearly  independent  rows  - 
contrary  to  our  hypothesis.  We  conclude  N 1 = fo}  . But  N ^ consists 
°f  yQ  for  which  B (A  ) y^  = 0,  k = 0,  1,  2,  . . . so  we  conclude  the 
implication  (7.9)  =>  (7 . 8),  and  hence  (7. 7)  =»  (7.  8),  is  true,  giving 
approximate  controllabilit' ■ in  this  special  case.  The  complete  demon- 
stration of  the  equivalence  of  (i),  (ii),  ( i ii)  is  a standard  exercise. 

Naturally  one  wishes  to  extend  these  arguments  to  the  case  of  the 
original  generator  A whose  resolvent  was  assumed  compact.  This  is 
discussed  by  Triggiani  in  [108]  and  by  Fattorini  in  [21],  [22].  In  [20] 
Fattorinl  shows  that  approximate  controllability  of  (7.2)  is  equivalent 
to  approximate  controllability  of 
dx  -1 

= (kl  - A)  x + Bu,  X.  € p(A)  . 

Assuming  this  result,  Proposition  7.  1 lends  to 

Theorem  7.2.  Let  the  resolvent  ( XI  - A)  ^ ^of_  A be  compact.  Then 
approximate  controllability  of  (7.2)  for  any  T>0  is  equivalent  to  the 
condition  that  the  array  (7.11)  should  have  maximal  rank  for  each 
i = 1,  2,  3, . . . 

In  general  these  results  do  not  extend  to  the  wave  equation  be- 
cause the  semigroup  (group,  actually,  in  that  case)  is  not  holomorphic 
and  (7.8)  is  not  equivalent  to  (7.7),  in  fact  (A*)k  eA  1 may  not  even 
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be  defined  on  the  whole  space  X - as  it  is  in  the  holomorphic  case.  A 


very  curious  paradox  arises  here,  however,  and  it  is  discussed  in  [108]. 
Consider  the  wave  equation 


.2 

d w d w 

— ~ - — 7 + r(x)w  = g(x)  u(t)  , 
9t  9x 


(7.12; 


with  boundary  conditions  (7.  1).  Let  the  Sturm-Liouville  operator  — 
92w 
9x2 


- r(x)w,  wdth  the  related  homogeneous  boundary  conditions,  have 


eigenvalues  -X^A.,,  -X^,  ...  and  associated  orthonormal  eigenvectors 
dj,  4>2,  . . . forming  a basis  for  L^fO,  1].  Then  we  have  the  expansion 


9 = Z gk  V 

k = l 

We  shall  suppose  no  g^  = 0 and,  in  addition,  that  the  g^  decrease  in 
magnitude  so  rapidly  that  g is  an  “analytic  vector”,  which  means  that 


V e±i(>"k)  1 


k=l 


gk  yo,k 


is  holomorphic  in  some  set  which  includes  the  real  axis  whenever 

OC 


y 

k=l 


(y 


0,  k' 


< oo  . In  this  case  the  degree  of  controllability  from  the 


standpoint  of  Section  3-5  is  very  weak  but,  curiously  enough,  for  the 
At 

semigroup  e associated  with  (7.  3),  (7.4)  the  conditions  (7.9)  and 

(7.7)  continue  to  be  equivalent  - not  as  a consequence  of  any  analyticity 
At 

of  e , but  as  a consequence  of  the  rapid  decrease  in  the  magnitudes 
of  the  g^  . As  a result  (7.  3),  (7.4)  is  approximately  controllable  for 
every  T > 0 for  such  "analytic  vectors"  g - even  though  the  results 
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parallel  to  those  of  Section  3-5  ([87])  apply  only  for  T >2  . Here,  for 
T < 2,  we  have  approximate  controllability  without  "eigenfunction  con- 
trollability" (see  end  of  previous  section).  Eigenfunction  controllability 

1 

is  equivalent  to  the  existence  of  biorthogonal  functions  for  the  e 

V 

(corresponding  to  the  e of  Section  4)  and  such  biorthogonal  functions 
do  not  exist  for  T < 2 . In  addition  to  being  of  interest  in  its  own  right, 
this  example  may  indicate  why  we  have  asked  the  question  concerning 
eigenfunction  controllability  at  the  end  of  Section  6. 

In  the  period  1968-1971,  roughly  speaking  a number  of  independent 
attempts  were  made  to  obtain  a more  detailed  understanding  of  the  con- 
trol and  observation  theory  of  the  heat  equation  on  a spatial  region  con- 
sisting of  an  interval  in  . It  is  interesting  that  the  control  and  ob- 
servation studies  were  carried  out  independently  with  the  researchers 
involved  not  fully  realizing  the  complete  equivalence  of  these  problems. 
It  is  also  true  that  the  results  obtained  were,  in  some  sense,  only  an 
" e -improvement"  over  those  obtained  earlier  by  Egorov  and  Gal'chuk 
([18],  [9],  [30])  though  a reading  of  the  various  papers  shows  that  a more 
precise  understanding  of  the  essential  nature  of  the  control  and  related 
observation  problems  was  developing. 

For  our  expository  treatment  it  will  be  sufficient  to  consider  the 
controlled  heat,  or  diffusion,  process  (7.  3),  (7.4)  with  the  same 
hypotheses  on  r,  g,  a„,  b , a ,b  as  set  forth  previously.  As  anticipated 
in  earlier  paragraphs,  we  let  - \^,  ^(x)  denote  the  eigenvalues  and 
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a2 
o vv 

associated  eigenfunctions  for  the  operator  — — - r(x)w  with  boundary 

bx 

conditions  of  the  form  (7.4). 

The  existence  and  uniqueness  theory  for  (7.  3),  (7.1)  with  initial 


and  expansion 


w(x,  0)  = w()(x),  wQe  L [0,1]  , 


w„  = Y w , <j>, 
0 0,  k k 


is  summarized  in  [27]  and  more  details  appear  in  [29]  and  other  standard 
references  on  partial  differential  equations.  Suffice  it  to  say  that  the 
solution  w = w(x,  t)  of  (7.  3),  (7.4),  (7 . 1 3)  can  be  expanded  as  series 
in  the  eigenfunctions  4>  (x),  convergent  in  L^[0,  1]: 


w( * , t)  = 2,  w (t)  0 , 

k = l k K 

and  the  w (t),  k = 1,  2,  3,  . . . can  be  seen  to  satisfy  the  first  order 
ordinary  differential  equations 


sr  +\wk  = 9ku<t)’  k = c 2. 3, . . . 


and  initial  conditions 


wk(°)  = w0>  k , 


(7. 16) 


where  the  w are  specified  by  (7.14)  and  the  g are  the  coefficients 

U j K k 

in  the  expansion  of  g : 


9k  ^ in  L"[ 0,1] 
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From  (7.  IS),  (7.  16)  we  have,  for  T > 0 , 

-XT  T -X  (T-t) 

w (T)  = e k w . f e g.  u(t)dt  . 

k u , k * 

For  the  moment  let  us  consider  the  null  controllability  problem  where 
we  want  w (T)  = 0,  k = 1,  2,  3 What  we  want  then  is  (cf.  Section  2) 


«(S)  C 9(C) 


(7.  17) 


where  S:  X 


~ Z - L^[0,  1 ],  Y = L^[0,  T] 

are  defined  by 

oo  -X,  T 

) = I e W0.k\  ’ 

(7.18) 

k=l 

‘Xk(T_t)  gk  u(t)dt)«t>R  . 

(7.19) 

oc 

s(T  w 

L-*  (. 

k=l 


k=l  0 

The  dual  observation  problem  involves  the  homogeneous  equation 


2 

8y  9 y 

at  " T 2 

9x 


+ r(x)y  = 0 


(7.20) 


with  boundary  conditions 


an  y(0,  t)  + b (0,  t)  = alY(l,  t)  + b^d,  t)  = 0 


and  observation 


<*(t)  = (y(-  ,t),g)  2 

L [0,1] 


(7.21) 


(7.22) 


For  continuous  (final)  state  observability  it  is  required  (Def.  2.  5)  that 
for  some  K > 0 

llu.ll  , > K H y( * , T)ll 

L [0,  T]  L^[0,1] 
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for  each  solution  y of  (7.20),  (7.21)  with 


y(-,o)  = v v0,k  in  • (7-23) 


Using  integration  by  parts  one  readily  sees  that  for  solutions  w,  y of 


(7.3),  (7.4),  (7.20),  (7.21)  respectively,  we  have 


(y(‘  0),  w( ' , T))  - ( y(  - , T),  w(-,0)) 

L [0,1]  L [0,1] 


T 

= / (y(T-t),  gu(t))  dt  . (7.24) 

0 L>,1] 


With  u(t)  = 0 and  (from  (7.18))  the  observation  that,  in  this  case 


w(-  , T)  = Sw(-  , 0),  we  infer 

s'"  y(-,0)  = y(-,T) 

and  with  w(-  , 0)  = 0 we  have  w(-  , T)  = C u so  (7. 23)  gives 

T 

(y(-,0),  Cu)  , = / (y(T-t),  g)  u(t)dt 

L [0, T ] 0 L [0,  T] 

= (cf.  (7.22))  = (w(T  - • ),  u) 

L [0,  T] 


and  thus 


C y(- , 0)  = w(T  - • ) . 


The  observation  problem  equivalent  to  null  controllability  is  to 


determine  whether,  for  some  K > 0 we  have 


||  w(T  - • )||  = II  c"  y(- , 0)||  > K ||  S y(-  , 0)||  = K ||  y( • , t|| 

L [0,T]  L[0,T]  L [0,1]  L [0,1] 
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(7.25) 


In  [27]  Fattorini  and  the  present  author  studied  the  problem  in  the  con- 
trol context  while  Mizel  and  Seidman  [67],  [68],  and  later  Dolecki  [14], 
studied  the  problem  in  the  observation  form  (7.  25). 

Just  as  in  Section  4 the  problem  reduces  in  the  end  to  a question 
of  linear  independence  of  exponential  functions  in  L^[0,  T],  but  now 

-v  v 

it  is  the  functions  e , k = 1,  2,  3,  . . . rather  than  e , - oo  < k < x , 

that  we  are  concerned  with.  This  is  an  old  problem  with  roots  in  the  theory 

of  Dirichlet  series.  In  [4  5]  Kaczmarz  and  Steinhaus  discuss  the 

t 2. 

Muntz-Szasz  problem  of  completeness  of  sequences  {e  } in  L [0,  T] 
(actually  they  take  T = 1 but  this  is  unimportant).  The  classical  result 
is  that  {e  ^ k1- 1 k = 1,  2,  3,  . . . } are  complete  in  L^[0,  T]  (but  are  not 
independent)  if 

00 

l — =« 

k=l  Rk 


while  if 


00 

y _L 

k=l  ^k 


< 00 


-V 


the  e are  independent  but  span  a proper  closed  subspace  of  L [0,T]  . 

oo 

Since  X = k + © (1)  we  have  Y — < » and  it  is  the  second  situation 
k k=l  \ 

which  applies  to  our  case.  In  this  case  it  has  been  known  for  some  time 
[45],  [99]  that  no  function  e lies  in  the  L [0,  T]  closure  of  the  re- 

-v 

maining  functions  e , t * k . We  refer  to  this  as  "strong"  in- 
dependence. It  is  clearly  equivalent  to  the  existence  of  biorthogonal 


functions. 


-1  18- 


r 


■ 


Let  E(A,  T)  denote  the  closed  subspace  of  L [0,  T]  spanned  by 

-y. 

i'e  | k = 1,  2,  3,  . . . } and  let  E(A,  k,  T)  denoted  the  closed  subspace 

‘S', 

spanned  by  {e  \!  ± k}  . For  each  k let  r be  the  closest  point 

-V 

to  e in  E(A,  k,  T): 

min  II  e k - r ||  , = II e k - F II  2 

r « E(A,  k,  T)  L [0,  T]  K L [0,  T] 


Elementary  Hilbert  space  theory  shows  that  with 

-X,  t 


pk(t)  = 


-rk(t) 


t € [0,T] 


k‘  ,|  2 

6 ’V1 


we  have 


-S' 


(P  , e ) z = 0,  ( * k 

L [0,T1 


(7.  26) 


-y  -S' 

(Pk,  e ) =(P,,  e -r  ) =1  (7.27) 

L [0,  T]  L[0,T] 

so  that  the  p,  form  a biorthogonal  set  in  E(A,T)  (with  the  inner  product 

2 -y 

induced  by  L [0,T])  to  the  functions  e 

To  study  the  question  as  to  whether  or  not  (7.25)  is  true,  we  note 

that 


II  S y(  • , 0)||  = II  y( • , T)l|  2 

L [0,1]  L [0, 1]  k=l 


» -X.  T 

= Z le  V 


0,  k 


(7.28) 


Now,  using  (7.23),  (7.25),  (7.26),  (7 . 27)  and  assuming  g^  * 0, 


k = 1,  2,  3 . 
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i i.  


-XkT  e k 

6 yo,  k = gk  yo,  k 

'XkT  T an  - X t 

= ^ / pk(t)(I  g,  e y0  f)dt 

yk  0 f= 1 ’ . 

-X,  T -XT 

k k 

= -“ (Pk,“)  2 (PJT,-),  C y(-,0))  2 

gk  L [0,T]  gk  k L [0,  T] 


< ^ p ||p,  || 

g,.  k t2 


, lie* ye,  0)||  2 

LlO,T]  L'[0,T] 


and  then  (7.  28)  gives 

'2XkT 

lls*vc,0)llz  < I 2 — II Pull2,  Hc%,.,0)|I22 

L [0,  L]  k=l  | g I L [0,  T]  L [0,  T] 

K 

Hence,  if  one  can  establish  that  there  is  some  M > 0 such  that 

-2 X.  T 

ao  k ? ? 

Y 5-  Up,  II  7 £ M (7.29) 

k=l  I gk  I k L [0,  T] 

then  K = — is  such  that  (7.25)  holds  and  we  have  both  observability 
M 

in  that  sense  and  null  controllability.  We  will  establish  (7.29)  under 
the  basic  assumption  (usually  satisfied  in  practice)  that,  for  some  y > 0 
and  integer  m > 0 , 


IgJ  £ Yk"  , k = 1,  2,  3, 


To  establish  (7. 29)  it  is  clear  that  one  must  have  bounds  on  the 


norms  of  the  functions  p^  . This  has  been  done  repeatedly  in  the 
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literature  of  the  past  ten  years;  usually,  but  not  always  ([62])  in  the 
context  of  control  or  observation  for  the  heat  equation. 

In  1962,  in  the  course  of  studying  the  time  optimal  control  problem 
for  the  heat  equation,  Yu.  V.  Egorov  obtained  in  [18],  [19]  a controllability 
result.  As  stated  in  [18]  it  appears  to  be  restricted  to  the  case  where 
r(x)  = 0 in  (7.  3).  It  is,  in  that  context,  equivalent  to  showing  that 

II  Pk  II  < Kq  exp(Ki  (cok  = ^ Xk  ) (7.31) 

for  an  arbitrary  e > 0;  K , Kj  positive  constants. 

In  1969  Mizel  and  Seidman  published  a paper  [67]  on  the  observation 
problem  for  the  heat  equation  on  a finite  interval,  obtaining  a result 
equivalent  to 

II  Pk II  1 KQ  expfKj  \k)  . (7.32) 

(From  their  references  it  seems  probable  that  they  were  not  aware  of 
Egorov' s result.  ) 

If  (7.  32)  is  used,  then  (7.29)  holds  for  T > Kj/2,  indicating  that 
the  control  and  observation  problems  are  solvable  for  such  T and,  insofar 
as  observation  is  concerned,  this  is  the  result  of  [67].  Mizel  and  Seidman 
were  skeptical  that  this  really  was  the  best  result.  If  (7.  31)  is  used  it 
becomes  clear  that  (7.  29)  holds  for  arbitrary  T > 0 . Mizel  and  Seidman 
returned  in  1972  with  a second  paper  [68]  and,  making  use  of  a result  of 
L.  Schwartz  [99]  which  implies  that  the  natural  restriction  map  from 
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E(A,  T,)  to  E(A,  Tj)  (T  > T^)  is  boundedly  invertible,  removed  the 
restriction  on  T,  showing  the  observation  problem  to  be  solvable  for 
arbitrary  T > 0 . In  fact  this  result  also  follows  from  the  work  of 
Fattorini  referred  to  earlier,  whose  theorem  on  the  time  invariance  of 
the  reachable  set  is  closely  related  to  Schwartz'  result  on  the  isomorphism 
of  EfAjTj)  and  EfA.T^)  . We  shall  have  more  to  say  on  this. 

In  1971  Fattorini  and  the  present  author  collaborated  in  an  attack 
on  the  controllability  problem  for  the  system  (7.  3),  (7.4),  aware  of 
Egorov's  result  [18]  but  not  of  the  cited  work  of  Mizel  and  Seidman. 

The  result  was  a paper  [27]  in  which  the  control  problem  was  solved  in 
two  different  ways  - but  with  essentially  the  same  result  both  times. 

One  method  involved  the  estimation  of  the  norm  of  the  biorthogonal 
functions  p^  introduced  above.  Following  the  results  in  [45]  and  [99] 
which  imply  that 


X.  30  \ JSL  \ 

,11  , < 42  TT  (u-f)/TT(i-^)  . 

L [0,  T]  j = l ) j = l 3 

j*k 


it  was  found  by  very  careful  estimation  that  for  KQ,  > 0 


||p  ||  < K exp(K  a)  ),  w = \fx  (7.33) 

U L [0,T]_  ° 1 ^ ^ * 

k = 1,2,  3,  ... 

and  it  was  seen  that  this  is  the  best  possible  result.  This  improves 
Egorov's  result  (7.  31)  by  "e"  and  again  shows  that  (7.29)  holds  for  arbitrary 
T > 0 . A second  approach  refined  results  of  R.  M.  Redheffer  [83], 
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using  the  Tourier  transform  to  obtain  biorthogonal  functions  q , not 

a -V  k 

lying  in  E(A,T)  but  still  in  L [0,  T],  such  that  (q  e ) = 6 

* L[0,T]  ' 

A /A 

and  an  estimate,  for  KQ,  > 0 


2 < K exp(K  w ),  w = 


L [0,  T] 


which  yields  the  same  results.  Since,  necessarily, 


(7.  34) 


k ~ 3,  . . . , 


Ul  2 l II  q.  II  2 

L [0,  T]  L [0,  T] 


the  result  (7.  34)  implies  the  result  (7.  33)  but  the  first  approach  allows 
explicit  identification  of  while  the  second  does  not.  This  "Fourier 
transform"  method  would  be  more  or  less  of  a curiosity  were  it  not  for 
the  fact  that,  with  slight  modification,  described  in  the  next  section, 
it  allows  one  to  see  that  each  controllability  result  obtained  for  the 
wave  equation  yields  a corresponding  result  for  the  heat  equation. 

We  have  noted  that  with  S and  C defined  as  above,  the  ob- 
servability result  (7. 2=4  is  equivalent  to  null  controllability,  i.  e.  , that 

2 2 

for  each  initial  state  w^  e L [0,  1]  there  is  a control  u e L [0,  T]  such 
that  the  resulting  solution  of  (7.  3),  (7.4)  satisfies 

w(-  , T)  = 0 . 

Each  of  the  above  results  bounding  II  P,  II  ? can,  however,  be 

L [0,  T] 

applied  to  identification  of  terminal  states  w(-  , T)  = w other  than  zero 

2 

which  can  be  achieved  using  controls  in  L [0,  T]  . For  this  part  of  the 
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problem  we  may  without  loss  of  generality  take  w(-  0)  = 0 . 
then,  if 


00 

w = V w * 

T & T,  k k 


From  (7.15) 


(7.  35) 


we  have 


"T .k-  /„  e 


T -\k(T-t) 


gk  u(t)dt  . 


(7.  36) 


Writing  u(t)  as  a formal  series  in  the  p • 

k 


u(t)  = T,  ^ 


k = l 


the  biorthogonality  property  (7.26),  (7.27)  of  the  p relative  to  the 

-V  . 

e gives 


w 


w 


T,k-gk^k  or  g, 


TJk 


For  the  given  final  state  (7.  35)  we  then  have  the  control 

°°  WT 

= l Pk(T't) 

k=l  yk 


(7.  37) 


and  w(-,T)  is  reachable  if  (and  only  if)  the  series  (7.  37)  converges  in 
L [0,  T]  . From  (7.  30)  and  (7.  33)  we  see  that  this  is  certainly  true  if 


'WT,  k'  - A0  9XP('A!  wk}’  k = l,  2,  3,  ...  , 


(7.  38) 


f°r  aq  > 0 ancl  > which  in  turn  is  true  if  the  desired  terminal 

i 

state  is  in  the  domain  of  the  unbounded  operator  exp(A^( -L) p)  (see  (7.6) 

ff-  )- 
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We  summarize  the  results  developed  above  in 
Theorem  7,  3.  Consider  the  control  system  (7.  3),  (7.4)  with  the  control 
'■istrioutii  m function  g satisfying  ( 7 . 30).  Then,  given  any  positive 
number  T,  an  initial  state 


wo  = Z.  w0,k  V 1 t°"  ^ 


(7.  39) 


and  a terminal  state 


w = V w * 

T T,  k k 


(7.40) 


such  that  '.v  belongs  to  the  domain  of  the  unbounded  operator 

~ z 

exP(Aj(-L)  ■ ),  there  is  a control  u e L [0,  T]  steering  a solution  of  (7.  3), 
(7.4)  from  wQ  to  w during  f 0,  T]  . Moreover,  the  positive  constant 
Aj  is  independent  of  T . 

For  definiteness  we  stipulate  that  (-L)2  is  defined  in  such  a way 
that  its  spectrum  {W|w£  = \ k = 1,  2,  3, . . . } lies  in  lo}U{\|- 
j < arg(\)  < 7 } . 

We  remark  that  it  is  known  ([27  ])  that  this  is  the  best  result  which 
can  be  obtained  if  we  insist  that  (7.  37)  converge  " absolutely" , i.  e.  , 


x , 
V T>  k 


I pH  < ^ 

L [0,  T] 


But  Theorem  7.  3 does  not  characterize  all  reachable  finite  states.  For 
L - tt,  r(x)  = 0,  b^  = bj  = 0,  g^  = — we  have  \ k and  the  initial 
state  w = 0 and  control  u(t)  = 1 yields  the  final  state 
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w 


,-hI 


-k  (T-t) 


T,  k k 


e ' ' dt 

1 r,  -kZT, 


= — [1  - e 

k3 


for  which  (7.  38)  is  not  even  "nearly"  satisfied. 

The  problem  of  boundary  control  of  the  heat  equation,  wherein 
(7.3),  (7. 4)  is  replaced  by  (7.  5),  (7 . 6)  is  shown  in  [27 ] to  be  reducible 
to  the  form  studied  above,  with  the  g^  bounded  and  bounded  away  from 
zero  if  b * 0 and  growing  like  k when  b^  = 0 . No  separate  treatment 
is  required. 

We  continue  this  section  by  proving,  for  the  special  case  of  the 
system  (7,  3),  (7.4)  considered  above,  the  theorem  on  the  time  invariance 
of  the  reachable  set,  referred  to  briefly  at  the  beginning  of  the  section. 
We  use  the  notation  of  (7.  17)  ff.  with  the  modification  that,  since  the 
whole  class  of  intervals  [0,  T],  T > 0,  will  be  considered,  the  control 
to  state  operator  will  be  supplied  with  a subscript,  viz.  , C , to  in- 
dicate the  control  interval  under  discussion.  The  question  of  controlla- 
bility concerns,  of  course,  the  range  of  . If  T^  > T^  then  clearly, 
since  we  can  take  u(t)  s 0 for  0 £ t < T^  - Tj, 

9(C  ) ^ »(C T ) • <7  • 

2 1 

From  the  duality  theory,  this  is  equivalent  to 

lie!;  yl!  1 Mjc*  yll,  y e Y , (7.42) 

LZ  1 

for  some  Mj  > 0 . 
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We  have  invariance  of  the  reachable  set  (from  the  zero  initial  state) 
provided  the  inclusion  in  (7.42)  can  be  reversed,  which  is  equivalent 
to  reversing  the  inequality  in  (7.-12). 


Now  for  the  system  (7.  3)  we  have 


“T(t)  = (C  w)(t)  (w(-,t),  g)  = Y g w (t) 

L [0,1]  k=l 

00  -\t 

= 9k  W0,t  6 ' ’ “ f0’T!  ’ 


(7.4  3) 


for  any  initial  state  w - V w 4>  . The  inequality  (7.42)  is  the 

i ■ ‘ -i  u , K K 

k-l  ’ 


same  as 


.11  > M |j  co  || 

2 L [0,T2]  1 L [0,  Tj ] 


If  we  can  establish  that  for  some  K.  > 0,  independent  of  w , we  h w 

2 0’ 


k„  II  > M Ik  || 

T’  ’ ~ 1 2 T2  L [0,T. 


1 L [0,  Tj] 


(7.44) 


we  have  our  result. 

Q 

To  this  end,  suppose  we  have  a sequence  k ] corresponding 

l 

to  initial  states  with 


f 


t " > ^ _ 2,  3, 

2 L TO,  T21 


7.4  5) 


while 


Um  = Ik!,  ||  =0 

f 1 L [0,  T] 


(7.46) 
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Let  Ip^}  be  the  sequence  of  elements  blorthogonal  to  the  e~  ^ in 
2 

L [0,1^1,  as  introduced  prior  to  Theorem  7.  3.  Thus 


-X, 


h1  = h 


Using  (7.  33),  (7.43)  we  have 

# 

l?kw0  J : IJ  w (t)  Pk<t)dtl 

< K exp( K \?  ) || w*  || 

2 L [0,T2] 

from  which  it  follows  from  (7.4  3)  that 

f i 

|oj  (z)  | < M , Re(z)  > p 

- p - 

for  any  positive  p . Applying  Montel's  theorem  [38]  we  conclude  that 

( l 

a subsequence  of  {oo  } which  we  will  still  call  {w  },  converges 
uniformly  to  an  analytic  function  w(z)  in  compact  subset  of  Re(z)  > p . 
Taking  p < T^  and  using  (7.46)  we  conclude  that  w(z)  = 0 . Then  (7.46) 
combined  with  the  uniform  convergence  on  [p,  T^  j gives 

lim  II J ||  = 0 , 

I 2 L [0,Tj 

which  contradicts  (7.4  6).  We  conclude  (7.46)  is  impossible,  which 
implies  (7.44)  for  some  M.,  > 0,  and  we  have  the  desired  result. 

We  remark  that  the  above  argument  is  essentially  the  same  as  that 
of  Schwartz,  referred  to  in  the  paragraph  following  (7.  32).  The  result 
has  been  extended  recently  by  Fattorini  [26]  to  cover  the  problems  of 
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b undary  introl  f higher  dimensional  parabolic  systems  which  we  will 
discuss  in  Section  8. 

Other  work  closely  related  to  what  we  have  described  so  far  in 
this  section  has  been  done  by  S.  Dolecki  [14].  He  is  concerned  with 
continuous  final  state  observability  of  (7.20),  (7.21)  in  the  case  where 
the  observation  (7.22)  is  modified  to 

w(t)  = y(x , t)  , 

A 

where  x is  a point  in  the  interval  [0,1]  . Making  use  of  number 

A 

theoretic  properties  of  x he  establishes  that  continuous  final  state 

A 

observability  holds  for  almost  all  x e [0,  1]  but  there  is  a dense  - 
subset  for  which  such  continuous  final  state  observability  does  not 

A A 

hold.  He  is  also  concerned  with  the  case  wherein  x = x(t)  varies 
with  time,  t . 

If  the  function  r(x)  in  (7.  3)  is  positive  and  in  (7.4)  either  a^ 
or  a is  different  from  zero  all  eigenvalues  -X  of  the  Sturm-Liouville 

1 ^ A 

operator  -Lw  = — - — r(x)w  are  negative  and  the  uncontrolled  system 
3x 

((7.  3),  (7.4)  with  u(t)  = 0)  is  asymptotically  stable.  Under  other 

circumstances,  however,  it  is  possible  that  finitely  many  of  the  -X^,  say 

for  k = 1,  2,  3,  . . . , K,  are  non-negative  and  some  sort  of  stabilizing 

control  is  called  for.  Even  if  all  -X,  are  negative  it  may  be  desired 

k 

to  move  some  of  them  further  to  the  left  in  order  to  improve  the  "settling 
time"  of  the  system.  This  type  of  problem  has  been  considered  by 
Triggiani  in  [107]  and  also  by  Sakawa  [98].  Letting  P and  Q be  the 
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orthogonal  projections  onto  the  subspaces  of  L [0,1]  spanned  by 

X.,X  , . . . , X - and  Xr>  . , X~  ,,  respectively,  and  now  reverting 

1 2 K K+l  K.+Z 

to  the  abstract  system  (7.2),  we  obtain 

Px  = PAx  + PBu  = APx  + PBu  , 

Qx  = QAx  + QBu  = AQx  + QBu  , 


which,  with  an  obvious  notation,  we  can  represent  by 


(7.47) 


If  this  system  is  to  be  even  approximately  controllable  the  system 

x = Aj  X1  + BjU 

(which,  at  least  insofar  as  x1  is  concerned,  is  finite  dimensional)  must 
be  controllable.  One  can  easily  see  then  that  there  is  a linear  feedback 
law 

u = Kx1  (7.48) 


such  that 

x1  = (A1  + B^x1 

has  any  desired  eigenvalues  p^,  p^,  • • • , in  place  of  the  eigenvalues 

-X.,  -X  , ...,  - Xa  . The  system  obtained  by  substituting  (7.48)  into 
12  K 

(7.47),  viz.  , 


.1  \ 

f A,  + B.K  ' 

0 \ 

/ 1 ^ 

x 1 - 

l l ; 

x 1 

• 2 ) 

I 

A,  ) 

2 

* / 

ib2k  : 

2/ 

Vx  ) 
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clearly  has  eigenvalues  N^,  ^2>  • • • > H-g»  -x£fI>  _Xk+2’  ' ' ‘ and  is 
asymptotically  stable  if  the  p . , i = 1,  2,  . . . , K have  negative  real 
parts. 

The  above  result,  which  shows  that  finitely  many  eigenvalues  can 
be  moved  at  will  without  moving  the  others,  can  be  strengthened  by  a 
variant  of  the  methods  used  in  Section  4.  In  [96]  it  is  shown  that  for 
the  system  (7.  3),  (7.4)  one  can  achieve  any  desired  eigenvalues 
P2,  P3,  • • • with  (cf.  (4-  65)) 


oc  I \ - u | 2 

V J_1  < oc 


J=1  3gi 


by  use  of  feedback 

u(t)  = (w(-  , t),  k) 

L [0,1] 

2 

in  (7.  3),  (7.4),  ke  L [0,1]  being  appropriately  chosen. 
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-war 


8.  HIGHER  DIMENSIONAL  PARABOLIC  SYSTEMS;  HARMONIC  ANALYSIS 
Of  BOUNDARY  VALUES  Of  SOLUTIONS 


Our  next  objective  is  to  study  the  heat  equation  which  "corresponds1 


to  the  wave  equation  (5.  1)  introduced  in  Section  5 
n 


9w  v 

oW-aF  - 


9 i 9w 

: (o'.  (X)  ) = 0,  t > 0,  X € £2  , 

i j=l  dx1  1 dX 


(8.1) 


with  d a1,  D having  the  same  properties  as  in  that  section.  In  par- 

j 

ticular 

9D  s r = rQ  U Tj 


and  we  suppose  control  to  be  exercised,  through  the  boundary  conditions, 
only  on  the  portion  of  the  boundary.  For  simplicity,  and  to  achieve 
correspondence  with  our  work  in  Section  5,  we  shall  suppose  the  boundary 
conditions  are 

w(x,  t)  = 0,  x € rQ,  t > 0 , (8.  2) 

(f|  (x,  t),  A(x)  v(x))  n=f(x,t),  xe  Tj  t>0  , (8.3) 

f,  required  to  lie  in  L^(T^  x[0,  t ])  for  any  T > 0,  being  our  control 
function.  An  existence  and  regularity  result  for  (8.  1),  (8.  2),  (8.  3)  is 
referenced  in  [28]. 

The  controllability  - observability  duality  relationship  is  important 
here,  as  always.  Letting  z = z(x,  t)  satisfy  equations  of  the  form  (8.  1), 
(8.  2),  (8.  3)  but  with  f = 0 we  have,  for  any  t>0,  0<t<T, 
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f w(x,  t)  z{x,  T-t)  P(x)dx 


-/  I (X.t)  Z(X,T-t)-W(X,t)  V -ib'w®  (1,1. 

a i,i  = i Sx‘  1 Sx‘  ) 8X1 

L 

= / div  (A(x)f|  (x>  l»  z(x>  T-t)  - w(x,  t)  (A(x)  |^-(x,  T-t)l  dx 


r a w 

J (_ax(x»t)»  A(x)  V'(x»  z(x,  T-t)  - w(x,  t)  (—  (x,  t),  A(x)  v(x))  ds 
3q  c'x 


= / f(x,  t)  z(x,  T-t)ds 
“F1 


(8.4) 


where  s denotes  surface  measure  on  dQ  . The  integrated  form  of  this 
is,  for  t > 0 , 

/ w(x,  x)  Z(x,  0)  p(x)dx  - f w(x,  0)  z(x,  r)  p(x)dx  = 
ft  n 


f f f(x,  t)  z(x,  T-t)ds  dt  . 

0 Q 

Following  the  schema  of  Section  2,  we  let  X = Z = L2(ft),  which  is  just 

2 P 
L (n)  with  the  "weighted"  inner  product 


f w(x)  w(x)  p(x)dx 


(w,  w)  2 = J 

L (ft)  n 


and  we  let  Y = L2(r  X [0,  t ] ) . We  define  S:L2(tt) 

1 P 


L (ft)  by 
P 


SwQ  = w(.  , t) 


(8.  5) 
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where  w is  the  solution  of  (8.  1),  (8.  2),  (8.  3)  with  f - 0 .vhich  also 

satisfies 

~ 2 

w (x,  0)  = w (x)  « L (ft)  (8.  M 

’ 0 p 

and  we  define  C = L ( r.  X [0,  t])  — L (ft)  by 

1 P 

Cf  = w (•  , t)  (8.  7) 

where  w is  the  solution  of  (8.  1),  (8.  2),  (8.  3)  with 


w (x,  0)  = 0,  X £ ft  . 


The  condition  9(C)  D 3 (S)  corresponds  to  controllability  of  ( 8.  1),  (8.  2), 
(8.  3)  from  an  arbitrary  initial  state  wQ  £ L^(ft)  to  the  zero  terminal  state 
while  9(C)  3 9 (S)  corresponds  to  approximate  controllability.  We  easily 

see,  just  as  in  Section  7,  that 

s’  z Q = z(-  , t)  (8.  8) 

where  z satisfies  (8.  1),  (8.  2),  (8.  3)  with  f = 0 and 

Tj  X[0,  t]  (8.9) 


c z. 


the  right  hand  side  of  (8.  9)  denoting  the  restriction  of  the  solution  to  the 

r -i  nn+l 
surface  x[0,  t]  in  R 

Via  duality,  the  condition  for  approximate  controllability  becomes 
the  distinguishability  condition 


C z0  = z 


TjXtO,  r] 


* 0 * Z0  = ° 
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This  has  been  studied  by  Lions  in  [57]  with  the  aid  of  a uniqueness 
theorem  of  Mizohata  [69]  which  plays  the  same  role  here  as  the  Holmgren  - 
John  uniqueness  result  of  Section  5.  The  result  is 

Theorem  8.  1.  The  control  system  (8.  1),  (8.  2 ),  (8.  5)  is  approximately 
controllable  in  time  t ]f  t is  any  positive  number  and  r]  is  an 
arbitrary  non-empty  relatively  open  subset  of  917  . 


We  remark  that,  since  the  range  of  S is  dense  in  this  case  there 
is  no  ambiguity  as  to  the  meaning  of  approximate  controllability.  (See 
the  remarks  in  the  paragraph  preceding  (2.  16)  in  Section  2.) 

To  proceed  further  than  this  it  is  necessary  that  we  carry  out  a 
more  detailed  analysis.  It  is  known  ([2],  [10])  that  the  operator 


1 y d i Bw 
Lw  = --  > — :(a.(x) — r) 

P i,j  = l dx  1 dx 


with  homogeneous  boundary  conditions  of  the  form  (8.2),  (8,  3)  (with 

2 

f = 0)  is  an  unbounded  self-adjoint  operator  in  L f 17)  having  eigenvalues 

P 


0 < X < 


Xk+1  < 


with  lim  X = <*  . (Actually  there  is  an  eigenvalue  X = 0 with  is 


empty,  an  important  case,  but  to  avoid  confusion  we  will  exclude  this 
case  from  our  general  discussion  here).  Each  X^_  has  finite  multiplicity 
m^  and  corresponding  eigenfunctions 


m 


k’ 
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can  be  selected  so  that  they  form  an  orthonormal  basis  for  L (T2)  . 

P 

Each  element  w e lZ(i2)  has  an  expansion 
P 

00  mk 

w = z z wk  f ^k  f 

k=i  rii  ’ ’ 


with 


„ m, 

00  k 


wv  . = / f w w<x)  p<x)dx»  Z I I 

r'>  e V>  V = 1 » -1 


w, 


< 00  . 


k=l  f=l 

If  we  let  w(x,  t)  be  a solution  of  (8.  1),  (8.2),  (8.  3)  with  initial  state 
(8.6)  having  the  expansion 

m 

«"  k 

w = V V w d> 

o ^ 0,k,l  k,f 


and  likewise  expand 


m 

oo  k 


w(’,t)  = Z Z wk  ,<*>  \ x ’ 

k=l  1=1  k,t 

- v 

then  by  choosing  z(x,  t)  = ^ (x)  e (a  solution  of  (8.  1),  (8.  2), 

(8.  3)  with  f = 0),  the  identity  (8.4)  gives 
' \T 

”k,f|T)  = c wo,k,( 


If  4>  (x)  e 

o r k,f 


Ak(T"^  f(x,  t)ds  dt  , 


k = 1,  2,  . . . , f = 1,  2,  . . . , m 


(8. 10) 


If  the  initial  and  terminal  states  wfl  and  w(-,t)  have  been  determined, 
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(8.  10)  constitutes  a moment  problem  for  the  unknown  control  function 

) 

f « ( r j X [0,  t ])  of  much  the  same  character  as  (7.  36)  in  the  preceding 

section.  Being  able  to  solve  (8.  10)  again  depends  on  finding  functions 

q,  (x,  t)  biorthogonal  with  respect  to  the  functions  p (x,  t)  = 
k,  ( _ k,  l 

(x)  e ' . Once  found,  (8.10)  has  the  formal  solution 

* 

HI,  X 

■ \T 

f(x,t)  = ^ H (wkf(T)-e  w0>  k>  t (x>  T-t#q]c  j (*,  T-t)  (8.H) 

and  one  can  proceed,  via  estimates  on  ||q.  || , to  determine  when 

t > * 

2 r 

this  series  converges  in  L ( I’  X[0,t|)  . 

For  special  pairs  £2,  and  p(x)  = 1,  A(x)  s I,  this  analysis 
has  been  carried  out  directly.  In  [25]  Fattorini  studies  the  case  where 
£2  is  a multi-dimensional  rectangle  ("  parallelopipedon" ) and  is  one 
face.  That  work  is  carried  out  in  the  controllability  context.  In  [68] 

Mizel  and  Seidman  obtain  equivalent  results  from  the  standpoint  of  ob- 
servability. In  [28]  Fattorini  and  the  present  author  have  considered 

the  case  wherein  12  is  the  unit  ball  in  Rn  and  F = 9t2,  the  unit 

1 

sphere.  In  both  cases  it  was  possible,  by  careful  study  of  elementary 
or  special  functions  to  obtain  explicit  bounds  on  the  biorthogonal  functions 
q,  . The  results  of  Theorem  7.  3 remain  valid  in  these  cases  with  L 
taken  as  the  Laplace  operator  in  £2  with  appropriate  boundary  conditions. 
In  [102]  Seidman  obtains  results  for  the  case  where  £2  is  a region  in  R1" 
lying  between  two  circles  centered  at  the  origin  and  between  two  rays 
out  of  the  origin.  We  shall  have  more  to  say  about  this  last  result. 
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For  a general  configuration  ft,  (or  " setting" , as  Seidman  calls 


it,  [10.2])  the  only  controllability  results  that  we  have  (other  than  the 
approximate  controllability  results  of  Theorem  8.  1)  come  to  us  indirectly, 
either  via  the  heat  equation  in  a larger  "regular"  region  ([100]  or  via  the 
wave  equation  results  of  Section  5 followed  by  the  Fourier  transform. 

In  [100]  Seidman  has  developed  an  "imbedding"  technique  applicable 
to  the  system 


9w 

"FT 


n 

V 


,2 

o w 


k=l  (9xK)2 


= 0. 


x e 


D 


t € [0,  T ] , 


(8. 12) 


»w(x,  t)  + (x,  t)  = f(x,  t),  x € r,  te  [0,t]  , (8.13) 

Here  ft  is  a bounded,  open,  connected  region  in  Rn  with  smooth 

boundary  r and  unit  outward  norm  v = v(x)  defined  for  x e r . This 

being  the  case,  we  proceed  with  Seidman  [100]  to  imbed  ft  in  a larger 

region  ft^  for  which  the  control  problem  for  (8.12),  (8.13)  has  already 

been  solved.  Then,  given  the  initial  state  w , we  extend  w to  a 
’ 0’  0 

g 2 6 

function  w^  e L (R)  (e.  g.  , by  setting  w^(x)  = w^(x),  x e ft,  and 

= 0,  x e R - ft)  . Let  R(Qq,  t)  denote  the  set  of  states  known  to  be 

reachable  for  the  system  (8.  12),  (8.  13)  (but  now  in  ftg  rather  than  ft  ). 

_ e 

Given  w € R(ft  , t)  we  select  a control,  call  it  f , which  steers 
t e’  ’ e’ 

3 6 

(8.12) ,  (8.13)  (in  ft  ) from  w„  to  w . Let  the  resulting  solution  of 

e 0 t 

(8.12)  in  ft  X [0,  t ] be  called  wG(x,  t)  . Since  9ft  C ft  U 9ft  , 

e 1 ’ 1 ’ — e e 


setting 
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.J 


we  ran  use  standard  uniqueness  and  regularity  results  to  see  that  the 

control  f thus  derived  steers  (8.  12 ) , (8.13)  (in  £7  now)  from  wQ  to 

e 

w restricted  to  £2  . This  result  can  be  expressed  oy 

R(n,  t)  2 R(ne,  r)  I . 

£2 

Commonly  one  takes  £2p  to  be  a ball  in  Rn  enabling  one  to  use  the 
controllability  result  in  [28]  or,  equally  well,  the  observability  result 
in  [68],  For  such  an  £2^  essentially  the  result  of  Theorem  7.  3 remains 
valid,  as  demonstrated  in  [100]. 

These  imbedding  methods  suffer  from  the  limitation  that  the  control 
must  be  applied  along  the  whole  boundary  unless  we  have  F = flr^ 
with  rQ  so  configured  that  repeated  reflections  produce  a larger  region 
whose  boundary  consists  entirely  of  F^  and  its  reflected  images.  This 
means,  effectively  that  r consists  of  two  line  segments  meeting  at  a 
point  with  the  angle  between  them  equal  to  Zn/m  for  some  appropriate 
integer  m . 

We  begin  the  rest  of  the  work  of  this  section  with  an  appreciation 
of  what  Theorem  5.  3 means  in  terms  of  the  harmonic  analysis  of  the 
functions 

iw^t 

= *|k|,f(x)  e ’ xe  ri>  te  [°>T1  - 
k = ±l,  ±2,  ± ',...,  ?=1,  2,  ...,m|^|. 
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which  are  restrictions  to  F x[0,T]  of  “eigenfunction"  solutions  of  the 
system  (5.  25),  (5.  26),  (5.  27)  with  u = 0.  We  are  supposing  that  T > 
(cf.  Theorem  5.  3)  and  we  are  adopting  the  notation  = n/'x^  , 
k 1,  2,  3,  ...  , - W X , k = -1,  -2,  -3,  . . . . The  numbers  -X^ 

ire  the  eigenvalues  of  the  Laplacian  in  S2  with  homogeneous  boundary 
conditions  of  the  form  (5.  26),  (5.  27).  With  4>  , k = 1,  2,  3,  . . . , 

K,  t 

f = 1,  2,  ....  m being  the  corresponding  orthonormal  eigenfunctions 
k 

that  states 


*k,  t 2 (iwk  ° | k | , t 


|k|,  r 


±1,  ±2,  ±3, 


( - 1,  2,  m|]^|  > 

form  a complete  orthonormal  set  in  the  space  V ,(S2)  = (cf.  (5.9)) 
equipped  with  the  energy  inner  product 


(8.15) 

Z 


< ((w,  v);  (w,  v)))  E = 


/»  A vh  0W  . , 

j (v v + L ~ ] -])dx  ■ 

£2  j = l 9xJ  9xJ 


Using  (5.  17)  (modified  for  complex  solutions)  with 


(w(-,0),  v(*  , 0))  = 0,  (w(-,T),  v(-,T))  = *£  * 

k,  f 


(8.16) 


-iw,  T 

K 


(y(-,T),  z(-,T))  = 3^  ,,  (y(  * , 0),  z(-  , 0))  = e ®k>| 


and  letting  q = q (x,  t),  X e l'  te  [0,  T]  be  a control  (cf.  (5.27)) 

in  LZ(T1  X [0,  T]  which  solves  the  control  problem  (5.  2 5),  (5.26),  (5.  27) 
( 8.  16)  we  find  that 
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) 2 

L (r}  X[0,  T]) 


(8.  17) 


so  that  the  q,  form  a biorthogonal  set  relative  to  the  p,  in 
k,  ( 

L2(T  x[0,T])  . This  result  can  be  compared,  qualitatively,  with  the 
results  on  nonharmonic  Fourier  series  in  [7  3],  [56],  [99],  [96]  (see  [87] 
for  summary)  and  is  obtained  by  what  is  seen  to  be  the  method  described 
very  briefly  following  (4.2  5)  in  Section  4.  We  remark  that  one  can  show 
by  methods  analogous  to  those  referred  to  in  connection  with  Theorem  5.2 


that  there  is  a TQ  < Tj  such  that  the  control  problem  is  not  solvable, 

and  hence  biorthogonal  functions  q^  f do  not  exist  in  L (F^  x[0,  I])  , 

if  x < Tn  — again  similar  to  the  above  noted  results  in  nonharmoni 

Fourier  series.  The  positive  result  for  T > Tj  is  only  qualitatively 

similar  to  the  classical  results  in  nonharmonic  Fourier  series  because  we 

c mn  >t  establish  (even  though  we  do  conjecture  it  to  be  true)  that  the 

2 

d and  q are  Riesz  bases  for  the  subspaces  of  L (T  x[0,T]) 
k, f k,f  1 

which  they  span.  (These  subspaces  can,  w.  l.o.g.  , be  assumed  identical 

2 

by  taking  the  q,  to  be  controls  of  least  L (F.  X [0,  T])  norm.  See 
k,  l 

[57]  for  existence  theorems  in  this  connection.  ) What  one  can  establish 


(see  [92],  [93]  is  that  for  some  positive  integer  r and  some  M ^ 0 


5k,  I C (Fj  x[0,  T])  - M w|k  I 
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One  may  also  arrange  (as  in  [92]  that 


qM(x’t)  = a-M'x’t)  ’ 


as  one  would  expect  from  (8.  15)  and  :he  definition  of  k = -1,  -2,  -3, 

We  will  assume  this  has  been  done. 

Let  functions  f(x,  z)  be  defined  for  x e and  z in  the 

complex  plane  via  the  Fourier  transform 


A n - i 7 1 /s 

Gk  ( (X’  Z)  ” / e qk  t (X’  T't)dl  ' 


The  G,  (x,  z)  are  of  class  r in  x and  entire  in  z . Moreover,  one 
k,  l 

may  show  very  easily  that  the  G (x,  z)  are  of  uniform  exponential 

k,  i 


type; 


i a i . , i z I T r 

lGk  i e u|. 


(8.18) 


for  some  > 0 . 

The  biorthogonality  relation  (8.  17)  now  becomes  (after  a trivial  change 


of  variable) 


/ ‘hki,^  gm  (x’  ^)ds  = 6m  • 


(8. 19) 


k,  k = ±1,  ±2,  ±3,  ...  , (,f  = 1,  2,  . . . .mu,,  m.£. 


Now,  using  only  the  G for  k = 1,  2,  . . . (no  negative  values  of  k) 

k,  f 

and  l = 1,  2,  . . . , m we  find,  by  setting 

5M,x’z)-SM(x'z|taM(x’  -z) 
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that  we  can  obtain  a function,  again  with  growth  like  (8.  18;,  but  now 
m even  function  of  z . Moreover,  (8.19)  gives 


\,«(x)  6k.?{><’:t‘“k,dl<  = 5M 


k,  E = 1,2,  ), 


f , f 


1,2,  . 


mk’  mk 


(8.  20) 


Since  G is  even  we  may  define 


. 2 


Gk  f (x,  -i  z ) = G k ^ (x,  z),  k = 1,  2,  3,  . . . , I = 1,  2,  . • • , 


and  obtain  again  a function  of  class  C with  respect  to  x and  entire 
in  z,  now  with 


lGM(x,z)|  12M0e'Z  >T 


(8.  21) 


Then  (8.20)  becomes 

/V  A 

5M  Gk,f 'IKk’dS  ' <8'“’ 

ri 

The  next  step  would  be  to  infer  that  Gf  Mx,  z)  is  the  Fourier  transform 

k,  f 

of  some  appropriate  function.  But  here  we  are  stopped  momentarily  be- 
cause (8.  21)  indicates  Gp  * (x,  z)  to  be  the  Fourier  transform  of  a 

k,  f 

distribution  with  support  at  t - 0 - not  a bona  fide  function  at  all.  This 

situation  is  remedied  with  use  of  a theorem  [83]  of  Redheffer.  G £ £ (x,  z) 

is  multiplied  by  an  entire  function  E (z)  designed  so  that  E (z)Gp  *(x,  z) 

^ 2 

is  the  Fourier  transform  of  a function  qp  * /x  t)  in  L*~(r,  & [0,  r])  - 

k,  f I 

this  can  be  done  for  any  t > 0 . Then  (8.  22)  reads 
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■a 


Et'-‘V  =/ 


r. 


k.  t 


(X)  E ( -i  \ ) Go  ~ (x  -i  \ ) ds 
t k k.  f k 


= //  %.,<*> 


-V 


o r. 


Oa  a (x,  t)  d s dt 
k,  l 


‘ ,pk.r  qU'k 


’ l <r.  x[o,t|) 


where  the  definition  of  p,  , should  be  clear.  Hence 

k,  f 


E ( - i V ) 
t k 


satisfies  the  biorthogonality  relation 


(pm>  qu 


l ( r{  x[o,t] 


= 6 


k,  f 
k,  ( 


The  moment  problem  (8.  10)  then  has  the  formal  solution  (8.  11).  By  careful 
estimation  of  the  numbers  E (~i  X.  ) one  can  establish 

J 2 1 K exp(K  u ) , (8.2  3) 

L (rjXfO,  r]  0 1 k 

k - 1,  2,  3,  ...  , l — 1,  2,  . . . , 

just  as  in  (7.  33).  Theorem  7.  3 is  then  valid  (with  the  obvious  modifica- 
tions) for  the  control  system  (8.1),  (8.2),  (8.  3)  with  p(x)  s 1,  A(x)  = I 
and  the  pair  ( S2,  r^)  star-complemented.  This  last  point  should  be 
re-emphasized  - the  controllability  result  just  obtained  applies  only  to 
star-complemented  pairs  (n,  r ),  since  that  assumption  is  vital  in  our 
proof  of  Theorem  5.  3 which,  in  turn,  provides  the  formulation  for  the 
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result  just  obtained.  Now  the  example  in  Section  5 of  the  rectangle  with 


control  applied  on  only  one  side  shows  that  controllability  of  finite  energy 

2 

states  with  controls  u € L (Fj  X [0,  t ])  cannot  be  generally  expected 
tor  the  wave  equation  without  the  condition  on  star  -complementation 
(or  (see  [72],  [106])  some  other  condition  which  guarantees  that  the 
inactive  portion,  F , of  bQ  does  not  "trap  waves".  Because  wave 
fronts  travel  along  straight  line  "rays"  ([11])  in  the  case  of  the  wave 
equation  this  is  not  surprizing.  But  one  can  make  no  convincing  case 
that  such  geometric  conditions  are  in  any  way  necessary  for  parabolic 
equations  modelling  heat  or  diffusion  processes.  A recent  example  due 
to  Seidman  [102]  shows,  in  fact,  that  parabolic  processes  may  exhibit 
a significant  degree  of  controllability  even  when  the  pair  n,  Fj  is  not 
star -complemented. 

To  explain  this  result  of  Seidman  we  use  a rather  less  general 
system  than  he  did  in  [102],  We  consider 


9w  9 w 9 w 

-jr 2 " — 7 = °»  0 , 0 < t < t,  (x,  y)  « Q ( 8.  24) 

9x  9y 

w(x,  y,  t)  = 0,  (x,  y)  € rQ,  w(x,  y,  t)  = f(x,  y),  (x,  y)  e r , 0 < t < r , 

(8.  26) 


where  f2  is  the  annular  region 


a = {(x,  y)  = (r  cos  0,  r sin  0)  | rQ  < r < ^ } . 


If  we  take 


- If)  S- 


ri  = {(X,  y)|  r = rx },  TQ  - {(x,  y)  I r = rQ  } , (8.26) 

then  the  pair  ft,  is  star-complemented  (ft  - !(x,  y)  | r<  rQ}  . On 

the  other  hand,  if  we  take 

T1  = {(x,  y)  |r  = rQ  },  rQ  = { (x,  y)  I r = ^ } (8.27) 

then  ft,  Tj  is  not  star-complemented. 

Following  the  duality  approach  used  repeatedly  in  earlier  parts  of 
this  paper  controllability  of  (8.  24).  (8.  25)  is  related  to  the  observability 
of 


dz 

Ht 


- ^-4  = 0,  0<t<r,  (x,  y)  € ft  , 

9x  9y 


(8.  28) 


z(x,  y,  t)  = 0, 


(x,y)  « rQ,  z(x,y,t)  = 0,  (x,  y)  € r 


(8.  29) 


cu(x,  y,  t)  = — (x,  y,  t)  (h  — - (r,  0)),  (x,  y)  e r,  (r  = r 


3r 


1 'or  ro}) 


(8.  30) 


We  will  consider  first  the  case  (8.  26)  where  ft,  r is  star-complemented. 
Here  our  earlier  controllability  results  apply  to  give  controllability  of 
(8.24),  (8.25)  for  arbitrary  t > 0 . To  express  this  in  terms  of  bounds 
on  biorthogonal  functions  we  note  that  the  eigenfunctions  of  the 
Laplacian  in  ft  with  homogeneous  Dirichlet  boundary  data  have  the 
form 


*M(r’  0)  = e 


ikO 


Jk.  ! 


(r), 


0,  1,2,  3,  . . .,  t = 1,  2,  3,  . . . 


where  the  z are  the  solutions  of  the 
k.  I 


for  Bessel's  equation. 


following  eigenvalue  problem 


_ 1 6 6 - 


-f 


(8.  31) 


lii  + i dAx 

,2  r dr 
dr 


(Xk,f  -->ZM  ' °’  ro<r<ri 

’ r 


2k.f(ro)  = zk,*(ri)  = 0' 


(8.  32) 


Assuming  (8.  31),  (8.  32)  has  been  solved  to  obtain  the  { and  f 
a solution  of  (8.  28),  (8.  29)  has  the  form 


z(x,  y,  t)  = z(r,  0,  t)  = 


00  oc  - \ t 

- z z t e ,f  ({r’0) 

k=0  1=1  k,t 


00  00  - X t 

V V ^ k,  l ik  0 

= ' > 4,  „ e e z,  Ar ) ■ 

—>  — < k.  f k,  t 

k=0  1=1  ’ ’ 


The  observation  (8.  30)  is  (continuing  to  assume  (8.26)) 


00  oc 


v1  v>  „ " ( t ikO  6zk,  t 

to(x,  y,  t)  = u(  0,  t)  = C,  e e 

k = 0 <=1 


Or  (ri}  • 


s Z Z &k  t pk  ■ 

k=0  (=1  ’ ’ 


(8.  33) 


The  foregoing  theory  of  controllability  for  star-complemented  configurations 

shows  that  the  set  of  functions  {p^  ^ } possesses  a biorthogonal  set  in 

lZ(r  * [0,  T ]),  namely  the  {q  } for  which  we  have  bounds  (cf.  (8.2  3)) 
1 rC,  * 


; K0  exp(Ki  wk,  t 


(8.  34) 


’ l (rx  x[o,  T ] ) 

(Note  that  the  indices  k,  f are  used  differently  in  this  example  than  they 
are  in  the  analysis  leading  up  to  (8.  2 3).)  The  result  of  Theorem  7 . 3 
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r 


depends  on  the  term  co  = \ \ appearing  in  the  argument  of  the 

k,  f k,  f 

exponential  in  (8.2  3),  (8.  34). 

If  we  now  pass  to  the  case  (8.  27)  the  only  change  is  that  (8.  33) 
becomes 


oc  x - X.  t , , 9z 

~ va  v k,  ! ike  k.  ( . 

(x»  Y,  t)  = w (0,  t)  = _j  N ^ , e e (r0) 

k = 0l=l  ’ 


* * 9zk  f ,d\t  ~\e  ike  9z 

= V !1  ^k,f(-#-(V/-ef-(r  i))e 

k=0  f=l  ’ 


— ^ (r  ) 
9r  ' l' 


9zk  t 9zk  f 

k=0  *=1  ’ 


oc  or 

_v  V 


oc  oc 

'v  VAy01'1 


k=0  f=l 


The  set  of  functions  (Pk  f)  clearly  has  in  L { r{  X [0,  t])  the  biorthogonal 


set  {q  }: 


r^  9z, 
*0 


«k , =r-|-^(ri,/-V<ro"i,k,( 


(8.  35) 


To  obtain  an  estimate  of  the  form  (8.  34)  for  the  q ^ ( 


we  need  bounds 


on 


9zk  f 8Zk  l 


dr  ' l"  9r  ' 0' 


Such  an  estimate  has  been  obtained  in  f 102 ].  Multiplying  (w.  31)  by 


r and  then  putting  s = log  r,  (8.  31)  becomes 


dZz 


ds 


¥ ♦ <•**  v , - “Xi  ■ " 


(8.  36) 


log  r0  < s < log  r^  . 
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d z 2 

Positivity  of  the  operator  - — - 4-  k z implies  we  must  have,  for  all 

ds 

k,  I , 


2 , 2 
r,  X , > k 

1 k,  t — 


(8.  37) 


We  may  assume  therefore  that  (8.  36)  takes  the  form 


d2z 


ds 


log  rQ  < s < log  r^ 

where  f (s)  is  bounded,  uniformly  in  k,  t and  s,  and  monotone 
k,  l 

(increasing)  as  a function  of  s for  each  k,  f . Using  the  differential 
equation  in  this  form,  a rather  nice  argument  in  [102]  establishes  that 


i 9z 


J£rl(r 

Or  ' 1 


3z. 


<ro» 


- Koexp(Kl“ k.P 


with  Kq,  Kj  depending  on  the  bound  satisfied  by  |f(s)|,  and  . 
From  this  and  (8.  34)  it  is  clear  that  the  q do  satisfy  an  inequality 
again  of  the  form  (8.  34)  and  one  obtains  the  result  of  Theorem  7.  3 for 
the  case  (8.  27)  via  the  route  (8.  10),  (8.  11)  again  - this  time  without 
any  condition  of  star-complementation. 

No  complete  characterization  of  settings  for  which  the  results  of 
Theorem  7.  3 (restated  for  (8.  1),  (8.  2),  (8.  3))  obtain  is  available  at  this 
writing. 
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9.  RELATED  TOPICS 


In  this  concluding  section  we  will  consider  three  topics  which  are 
not,  in  the  strict  sense,  included  in  the  area  of  inquiry  announced  in 
the  title  of  this  article.  They  are;  controllability  of  nonlinear  systems; 
the  linear-quadratic  regulator  theory  for  linear  distributed  parameter 
systems;  and  the  time  optimal  control  problem  for  Linear  distributed 
parameter  systems.  They  are  legitimate  subsidiary  topics  for  this  sort 
of  review  paper  because  developments  in  these  areas  are  essentially 
dependent  upon  the  existence  of  an  adequate  controllability  and  stabil- 
izability  theory  for  related  linear  systems.  We  can  give  only  the  briefest 
idea  of  the  nature  of  existing  and  developing  results  because  those  re- 
sults are,  for  the  most  part,  incomplete  and  fragmented  and  because  of 
space  limitations.  There  are,  of  course,  other  topics,  e.  g.  , the  repre- 
sentation theories  of  Baras  and  Brockett  ([3],  [4]),  Helton  ([35],  [36]) 
and  others  which  are  also  related  to  controllability.  We  do  not  attempt 
to  treat  these  in  the  present  article. 

9a.  Controllability  of  Nonlinear  Systems 

We  have  about  fifteen  years  of  history  for  controllability  of  a non- 
linear finite  dimensional  system 

x = f(x,  u),  X « E^,  u e E™  . (9.  1) 

Supposing  that  f = Em+n  -*•  E*1  is  of  class  C with  f(0,  0)  = 0,  one 
seeks  to  investigate  controllability  of  (9.1)  for  (x,  u)  near  (0,0)  under 
the  basic  assumption  of  controllability  for  the  linearized  system 
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1 


x = Ax  + Bu,  A = .—  (0,0),  B —(0,0)  . (9-2) 

’ dx  du 

The  earliest  definitive  result  ippe  r.  t he  th<  paper  [66]  of  Lee  and 
Markus  appearing  in  1961.  A ' n - •/.*.  ■'  streamlined"  version  appears 
in  their  book  [56],  We  will  n '.h-  n ■.  pr  i h,  based  on  the  implicit 
function  theorem,  but  will  j h it  • tting  (developed  in  [47], 

[61])  which  we  have  already  intr  u r tion  2. 

Consider  the  problem  >f  "re a liability".  We  let  x(0)  = 0,  for 
T > 0,  and  consider  those  state  : x ^ which  an  be  reached  at  time  T 
with  ue  LZ([0,  T];  E™)  . We  have 

T 

x = x(T)  = f eA(  * ' B u(t)dt  . (9.  3) 

1 0 

We  consider  a control  u depending  on  an  n-dimensional  parameter 
vector  | : 

u(£,  t)  = B*  eA  (T-t)  £ = U(t)  e • (9-4) 

Substituting  (9.4)  in  (9.  3)  and  solving  for  £ we  have 

£ = Z(T)'1  xL  , 

where  Z(T)  is  the  matrix  in  (2.  10),  whose  inverse  exists  just  in  case 
(9.2)  is  controllable.  This  establishes  a nonsingular  linear  transformation 

xL  = x^)  = Z(T)£ 

between  the  state  x and  the  parameter  £ determining  the  control 
u = u(£,  t)  . 
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Writing  (9.  1)  in  the  form 


x = Ax  + Bu  + g(x,  u)  , 


with  g(x,  u)  = f(x,  u)  - Ax  - Bu,  we  have 


x , = f eA(r't)(Bu(t)  + g(x(t),  u(t)))dt  . 
0 


Let  us  again  write  u = u(£,  t)  in  the  form  (9.4).  We  then  have 


x - / eA(r_t)(BU(t)e  + g(x(£,t),  U(t)£)dt 


= m)  , 


with  x(£,  t)  the  solution  of 


9x(|,t) 


= f(x(£,t),  U(t)£),  x(£,t)  = 0 


Following  [55]  and  [61]  we  compute 


= f eA(T_t)  BU(t)dt 


+ / eA(T't)[g(x(e,t),  U(t)g)  ||-(g,t)  +|f(x(g,t),  U(t)g)  U(t)]dt 


When  £ = 0 we  clearly  have  x(£,t)  = 0,  U(t)£  = 0 and,  since 

9g  9g  , , 

^ (0,  0)  = 0,  ~ (0,  0)  = 0,  we  find  that 


9F  r 
3 = 


eA(T-t)  B U(t)dt  = (cf.  (9.4)) 


f eA(r't)  BB*  eA  (T_t)dt  = Z(T)  , 
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i.  e.  , the  Jacobian  of  the  map  (9.  5)  at  £ = 0 is  the  map  Z(T)  associated 


r 

with  the  linearized  system  (9.  2).  The  implicit  function  theorem  shows 
immediately  that  (9.  9)  is  solvable,  yielding 

e = 

for  Xj  in  some  neighborhood  of  0,  showing  that  (9.2.)  remains  control- 
lable in  the  sense  that  it  can  be  steered  from  the  origin  to  any  sufficiently 
small  state  Xj  during  [0,  T]  . 

Most  existing  results  on  controllability  of  nonlinear  distributed 
systems  follow  this  same  pattern.  A control  to  state  (or  control  parameter 
to  state)  map  is  constructed  for  the  nonlinear  system  and  its  Jacobian 
is  shown  to  be  the  comparable  map  for  the  linearized  system. 

One  exception  to  this  rule  is  the  paper  [9]  of  M.  Cirina.  In  it 
he  considers  a quasilinear  generalization  of  the  hyperbolic  system  studied 
here  in  Section  3,  viz. 

9w  9w 

— = A(x,  t,  w)  — +f(x,t,w)  . (9.6) 

In  his  work  controls  are  applied  at  x = 0 and  x = 1 via  (cf.  (3.11),  (3.  12), 
(3.13)) 

w"(0,  t)  = u(t),  w+(l,  t)  = v(t)  (9.7) 

or  else  just  at  one  end; 

w (0,  t)  = 0,  w+(l,  t)  = v(t)  . (9.  8) 

In  either  case  Cirina  considers  the  "null-controllability"  problem 
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w(x,  0)  = 4>(x),  w(x,  T)  = 0 


since  (9.6),  (9.7),  (9.8)  are  not  time  reversible.  However  one  may  modify 
the  method  to  allow  w(x,  0),  w(x,  T)  to  both  be  given  arbitrarily  (within 
an  appropriate  class)  in  the  case  of  time  reversible  systems.  Theorem 
3.2,  with  certain  modifications  so  that  the  Goursat  problem  is  replace  .' 
by  a standard  initial -boundary  value  problem  for  the  equation  obtained 
from  (9.6)  by  reversing  the  roles  of  x and  t : 


3w 

— = A(x,t,w) 


- f(x,t,w)] 


Exact  controllability  results  comparable  to  those  of  Theorem  3.2  are 
obtained  constructively,  for  initial  states  q>  e E!*([0,  1];  Em)  with 
sufficiently  small  norm. 

Cirina's  result  is  not  based  on  the  implicit  function  theorem  - 
and  probably  could  not  be.  A fair  description  of  his  method  would  seem 
to  be  this.  Theorem  3.  2 provides  a method  whereby  one  can  construct 
a map  from  w(- , 0)  to  a control  u (or  control  pair  u,  v)  steering 
w(- , 0)  to  zero  at  time  T . (Theorem  3.  2 can  be  modified  to  treat 
controls  applied  at  x = 0 and  x = 1;  see  [86].)  Cirina's  contribution 
consists  in  his  establishing  that  this  mapping  procedure  can  be  carried 
over  to  the  quasilinear  systems  (9.  6).  The  method  is  more  direct  than 
the  implicit  function  method  which  first  constructs  the  nonlinear  map 
F (c f.  (9.  5))  and  then  infers  the  existence  of  a nonlinear  F ^ based 
on  properties  of  the  Jacobian  of  F . The  method  is  heavily  dependent 
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upon  the  fact  that  the  roles  played  by  x,  t in  hyperbolic  systems  in 
two  variables  are  interchangeable  and  thus  appears  not  to  be  extendable 
to  other  types  of  systems. 

The  first  use  of  implicit  function  type  methods  to  establish  con- 
trollability of  nonlinear  distributed  parameter  systems  appears  to  be 
:ue  to  H.  O.  Fattorini  [24].  He  considers  a nonlinear  wave  equation 


9 w 9 , . 9w 

p(x)  — 2 = ~fa  (a(x)  ‘ F(x>  w>  + 9(x)  u(t)  > 

9t 


0<x<!,  0<t<T,  P(x)  > p0  > 0,  a(x)  > aQ  > 0 . 


The  basic  assumption  on  F is  that  it  is  of  class  o1  and  odd  as  a 

d p 

function  of  w with  -7—  (x,  0)  > 0 . The  boundary  conditions  initially 
take  the  form 


8 w Z ? 

a0  w(°,t)  t bn  -r^lO.t)  = 0,  (a0)  + <b„>  *0  , 


0 


3w  Z Z 

a!  w(i,t)  + bj  — (1,  t)  = 0 (at)  + (bp  *0, 


but  other  restrictions  are  imposed  in  the  course  of  the  work.  The  problem 

treated  is  that  of  reachability: 

3 w 9 w 

w(-,0)  = — (-,0)  = 0,  w(* , T)  = Wj,  — (-,T)  = Vl  . 


It  is  studied  with  reference  to  the  same  problem  as  posed  for  the  linearized 
equation 


. 9 w 9 9w  9F 

p(x)  — • 
ot 


(9.  9) 
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This  problem  can  be  resolved  by  the  method  presented  at  the  beginning 
of  Section  4,  involving  a moment  problem  in  the  theory  of  nonharmonic 
Fourier  series. 

The  operator 


1 r 9 9w , 9F  , , , 

L = [-, — (a(x)  — + (x  • w] 

p(x)  1 c)x  ' ' ’ ctx  dw 


with  the  indicated  boundary  conditions  is  unbounded,  positive,  self- 

adjoint  in  L“[0,  1],  the  eigenfunctions  o forming  an  orthonormal  basis 
P K 

for  [0,1]  . (Recall  the  inner  product  in  [0,1]  is  (w,  w)  = 
j P P P 

f w(x)  w(x)  p(x)dx  .)  Expanding  g: 

0 


or 


gk  \ 


one  requires 

^ k = 1,2,3,... 

lim  inf  kgt  - g > 0 . 
k— oc 

lim  sup  kg  = g > 0 . 
k-*  » 

Under  these  assumptio  is  it  is  shown  in  [24]  that  all  states  (w^,  v^)  c HxK 
are  reachable,  where 

H = (w  € Dom(L)  } 

* 

1 

K = {w  « Dom(L  ) } 


equipped  with  the  inner  products 


(w,  w)  = (Lw,  Lw)  , 

H p 

_ 1 _ 

(V,  v)K  = (L ! v,  L2  w)  , 

provided  T _>  T^,  the  minimal  control  time  discussed  in  earlier  sections. 
For  T = Tj  the  control  to  state  map 

C:  u e L2[0,  T]  -*  (w,  v)  e H X K 


is  a Hilbert  space  isomorphism. 

The  main  part  of  the  work  consists  in  establishing  that  solutions 

of  the  nonlinear  system  (9.  9)  continue  to  lie  in  H X K and  that  the 

nonlinear  control  to  state  map  has  C,  the  control  to  state  map  for  the 

linearized  system,  as  its  Jacobian.  Once  this  has  been  done  familiar 

extensions  of  the  implicit  function  theorem  to  infinite  dimensional  spaces 

(see  e.  g.  [13])  apply  to  show  that  the  nonlinear  control  to  state  map 

2 

carries  a small  neighborhood  of  0 in  L [0,  T]  onto  a small  neighborhood 
of  (0,  0)  in  H X K - thus  establishing  local  reachability  for  the  non- 
linear system. 

This  approach  was  carried  further  by  the  late  W.  C.  Chewning  [8] 
who  applied  it  to  a higher  dimensional  nonlinear  hyperbolic  equation 
with  boundary  value  control  — the  linearized  version  of  which  is  studied 
in  Section  5 of  this  revievr  article.  The  basic  approach  is  the  same  as 
Fattorini's  but  there  are  significant  differences  in  the  assumptions  which 
have  to  be  made  on  the  nonlinearities  involved.  Chewning' s system 
takes  the  form 
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where  £2  is  a domain  of  the  type  discussed  in  Section  5.  (Actually  £2 
is  taken  to  be  a higher  dimensional  rectangle  or  " parallelopipedon"  in 
[8]  but  this  is  unnecessary.  ) The  control  takes  the  form 


w(x,  t)  = u(x,  t),  x € 3 £2,  t € [0,  T] 

for  some  T > 0 . The  analysis  is  carried  out  in  the  space  v""(£2)  (cf. 

2 1 

(5.28))  and  the  principal  assumptions  on  f:  V (£2)  — H (£2)  (which  may 

or  may  not  take  the  form  f(w(x,  t),  ^-(x,  t) ) ) are  (i)  f is  continuous  and 

continuously  differentiable  in  a neighborhood  of  (0,0)  in  y2(£2); 

(ii)  f(0,  0)  = 0;  (iii)  II  f(w,  v)  - f(w,  v)  ||  < ||  (w,  v)  - (w,  v)||  • C(||(w,  v)||  , 

II  (w,  v)  ||);  and  (iv)  the  Frechel  derivative  of  f is  locally  Lipschitz  near 

the  origin.  If  f has  the  form  f(w(x,  t),  ^(x  t))  all  those  conditions 

are  satisfied  if  f has  Lipschitz  continuous  partial  derivatives  of  order 

£ 2 . These  assumptions  permit  one  to  again  establish  that  the  control 

to  state  map  for  the  nonlinear  system  has  the  control  to  state  map  of  the 

2 

linearized  system  as  its  Jacobian.  That  the  latter  maps  onto  y (£2)  is 

established  as  indicated  in  Section  5.  The  implicit  function  theorem 

again  shows  that  the  map  for  the  nonlinear  system  has  range  which 

2 

includes  a neighborhood  of  the  origin  in  V (£2)  . 

Concerning  exact  controllability  of  nonlinear  parabolic  systems 
very  little,  if  anything,  has  been  reported.  The  implicit  function 
approach  appears  to  be  unusable  because  it  is  very  difficult  to  identify 
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a topology  for  the  state  space  so  that  the  control  to  state  map  for  the 
linearized  system  is  continuous  and  onto  while,  at  the  same  time,  state 
perturbations  due  to  nonlinerities  continue  to  lie  in  this  space.  This 
quest  is  made  doubly  difficult  by  the  fact,  noted  in  Sections  7,  8 that 
we  have  no  adequate  characterization  of  the  set  of  reachable  states  for 
parabolic  control  processes. 

In  the  case  of  approximate  controllability  of  nonlinear  parabolic 
equations  we  are  more  fortunate.  In  a recent  paper,  [37].  M.  Henry 
has  introduced  a number  of  techniques  which  he  shows  to  be  useful  in 
such  studies  and  which  appear  to  hold  considerable  promise  for  wider 
application.  The  technique  is  basically  one  of  "cancellation",  the 
control  being  used  to  steer  the  linearized  system  and,  at  the  same  time, 
"cancel"  the  state  perturbation  due  to  the  nonlinearity  present  in  the 
system  - at  least  approximately.  Various  approximate  controllability 
theorems  are  given,  corresponding  to  a number  of  different  control 
situations.  We  will  provide  the  barest  outline  of  one  of  them  in  a setting 
somewhat  oversimplified  as  compared  with  what  Henry  actually  employs. 

Consider  then  the  equation  of  Section  7,  modified  now  to 

p(x)  ~ - V — ~r  (a!(x)  ) + f(w(-  , t))(x)  = 0,  x e S2,  t > 0 , (9.  10) 

9t  ij  = l 3X1  J 3x] 

with  boundary  conditions 

f°  x e r 

(-rjf  (x,  t),  A(x)  v(x)  =/  (9.  ID 

E \ju(x,  t),  x c r{ 
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and  initial  state 


w(x,  0)  h 0 . 

2 2 

The  assumption  on  f:  L ( S2)  — L (12)  is  that  it  should  be  continuous  and 

P P 

bounded.  Approximate  controllability  of  the  system  with  f = 0 for 
2 

controls  u e L“(  Tj  X [0,  t ]),  t > 0,  follows  from  Fattorini's  result 

which  we  have  summarized  here  in  Theorem  8.  1.  Thus,  given  a desired 

state  w e L^(12),  and  a "tolerance"  e > 0,  we  can  find  a control 
1 P 

u = u such  that 
e 


I v(-  , t)  - w ||  < e , 

L (12) 

P 


(9.  12) 


v satisfying  (9.10),  (9.11)  (with  w replaced  by  v,  f replaced  by  zero). 

2 2 

Now  for  z € L ([0,  t ];  L (12))  consider  the  linear  nonhomogeneous 

P 

equation 


By  v1  9 i,  By 

"Waf-.E  , -i<aj,x)7) 

i,j  = l 3x  Bx 


-j  (a:(x)  ) f f(z(-  , t))(x)  = 0 (9.13) 

c 1 dx 

A ^ 

with  the  same  boundary  conditions  as  above.  Writing  y = y + y where 

ys  n A 

By  v 3 i By 
P<x)-T^-  - ^ — r (a  (X)  ) + f(z(-  , t))(x)  = 0 

i,J  = l Bx  J BxJ 

A 

y (• , °)  = °,  ^ (x,  t),  A(x)  v(x))  n = o,  X6  r,  t>o  , 


wg  see  that  we  can  achieve 


II  y(*  , T)  - Wj  l|  , < e (9.  14) 

L^fi) 

P 


by  applying  (9.12)  with  v(- , t)  replaced  by  y(-,T)  and  w^  replaced 

A 

by  w - y(*  , t)  (we  are  neglecting  certain  technical  refinements  here; 

' s 

see  [37]  Theoreme  3).  Supposing  (9.14)  to  be  achieved  with  a control 

u with  ||u||  < R,  Henry  defines  a map  q (z)  = {y  = y(x,  t)  | y 

L (T  x[0,  r] 

satisfies  (9.  13)  ana  boundary  conditions  of  the  form  (9.  11)  with 

II  u ||  , <R  and  y(- , t)  satisfies  (9.  14) } . Actually  ct  is 

L (Tj  X[0, t ] e 

a multi-valued  map  because  we  do  not  know  that  u is  unique.  For  the 

2 2 

purposes  of  this  expository  article  we  may  think  of  c?  : L ([0,  t],  l (ft)) 

r o 


H 


2 2 

L ([0,  t];  L (f2))  but  Henry  uses  a more  complicated  space,  namely 
i/2’  J/4(0  X [0,  T ],  . 


A fixed  point  of  the  multivalued  map  Q is  a point  y such  that 

e 

y « (y)  • It  is  clear  that  such  a fixed  point  is  a solution  of  the  non- 

e 

linear  system  (9.  10),  (9.  11)  and  satisfies  ||y(-  , t)  - w II  < e . In 

I>) 

[37]  this  fixed  point  is  shown  to  exist  by  establishing  that  _7  is  upper 
semi-continuous  in  an  appropriate  sense,  after  which  application  of  the 
Kakutani -Tychonoff  theorem  [5]  gives  the  desired  result. 

The  whole  field  of  nonlinear  controllability  is  very  much  in  its 
infancy  and  promises  to  provide  grist  for  many  a mathematical  mill  for 
some  time  to  come. 
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9(b)  Linear-Quadratic  Regulator  Theory 


J 


We  have  noted  in  Section  1 the  great  impact  which  the  linear- 
quadratic  optimal  control  theory  originating  with  Kalman  and  Bucy  [47], 
[48]  has  had  on  modern  control  systems  design.  Because  the  theory  is 
heavily  dependent  on  stabilizability  and  hence,  in  most  cases,  on  con- 
trollability, it  is  appropriate  that  we  should  give  some  attention  to  it 
here.  Our  treatment  is  necessarily  brief  and  incomplete. 

We  will  not  discuss  the  theory  as  it  applies  to  parabolic  equations 
because  that  area  has  been  so  fully  and  so  successfully  covered  by 
J.  L.  Lions  in  his  landmark  book  [57]:  paraphrasing  Shakespeare 

we  might  say  that  we  "honor  his  contributions  in  exclusion"  from  our 
less  complete  discussion.  Lions  has  treated  hyperbolic  problems  as 
well  but  the  discussion  in  that  case  is  necessarily  less  complete  because 
the  required  controllability  results  were  not  available  when  the  book  was 
written. 

The  most  complete  controllability  results  for  hyperbolic  equations 
are  those  for  the  linear  symmetric  hyperbolic  system  in  two  independent 
variables  discussed  in  Sections  3 and  4.  It  is  for  that  system  that  we 
shall  pose  and  discuss  the  linear-quadratic  problem  here.  The  treatment 
follows  the  general  lines  presented  in  [91]. 

We  consider  then  the  system 

9w  9w 

aT  = A(X)  *9x  + B(x)w  ’ (9.15) 


9 
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(9. 16) 


■■ 


w'(0,  t)  = D0  w + (0,  t)  , 
wf(l,  t)  = Dj  w (1,  t)  4-  Du(t)  , (9.17) 

all  matrices  appearing  here  and  all  notational  conventions  to  be  used 
hereafter  being  those  of  Section  3 with  the  same  hypotheses  as  set  forth 
there. 

Let  T > 0 . Corresponding  to  an  initial  state 

w(*  , 0)  = wQ  e L2([0,  1];  En)  (9. 18) 

2 

and  control  u e L [0,ao);  we  define  a quadratic  cost  functional 
T 1 1 

J(u,  w T)  = f [f  f (w(x,t),  W(x,  £)  w(£,  t))  n dxd£ 

U 0 0 0 E 

+ (u(t),  Uu(t))  ldt  , (9.19) 

Eq 

wherein  we  assume  the  q xq  matrix  U is  symmetric  and  positive 
definite  while  the  continuous  n X n matrix  function  W(x,  £)  satisfies 
(*  denoting  adjoint) 

W(x,  i)  = W(£,  x)"  , 0 < x < 1,  0 < e < 1 

and 

11  2 n 

f f (w(x),  W(x,  ^)  w(^))  dxd^  >0,  w e L ([0,1];  E ) . 
j j ri 

0 0 E 

In  (9.19)  it  is,  of  course,  assumed  that  w and  u together  satisfy  (9.16)- 
(9.18). 

A /A 

The  conditions  necessary  and  sufficient  in  order  that  a pair  w,  u 
should  (uniquely)  minimize  (9.  19)  are  as  follows.  Let  v = v(x,  t)  solve 


-183- 


MM 


the  adjoint  equation  (cf.  (3.2  3),  modified  with  a nonhomogeneous  term) 

9v  9v  * ^ 

liT  " A(X)  frx  ' (B  (X)  ' A'(x)v  “ f w<x>£)  w(M)d4  (9.20) 

" 0 

with  boundary  conditions  of  the  form  (3.  24),  (3.  25)  and 

v(*  , T)  = 0 . (9.  21) 

Then  (cf.  (3.4),  (3.5)  for  +,  - notation) 

u(t)  = -U-1  DT  A+<1)  v + (l,t),  te[0,T]  . (9.22) 

The  equations  (9.15),  (9.20)  with  initial  and  terminal  conditions 
(9.  18),  (9.  21),  respectively,  together  with  the  noted  boundary  conditions, 
are  coupled  by  (9.  22)  and  hence  constitute  a non-trivial  two  point 
boundary  value  problem  for  the  partial  differential  equations  involved. 
This  is  a very  complicated  mathematical  entity;  if  we  were  forced  to  work 
within  this  framework  the  results  which  we  could  establish  would  be 
severely  limited.  Fortunately  we  have  the  earlier  works  cited  above 
which  provide  the  critical  notions  needed  to  make  further  progress.  The 
most  important  idea,  which  comes  from  Kalman's  solution  [47]  of  the 
Hamilton- Jacobi  equation  associated  with  this  optimization  problem  is 
the  possibility  of  expressing  the  adjoint  solution  v as  a linear  function 

A 

of  the  optimal  system  state  w: 

1 

v(x,t)  = f Q(x,  £,  t)  w (£,  t)d£  . (9.23) 

0 

Substituting  (9.  2 3)  into  (9.  20)  and  using  (9.  15)  one  obtains  for  the  matrix 
0 the  partial  differential  equation 
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90  9Q  3Q 

= A(x>  air  + H A(^>  + P(Q) 


(P(Q(-,  • , t)))(x,  i)  - -(B(x)  - A' (x))  Q(x  e,t) 


- Q(x,£,t)(B(£))  - W(x,£) 

+ Q (x,  1,  t)  A+(l)  DU'1  D V(l)  Q + (l,  t,  t) 


The  terminal  condition  is 


Q(x,e,T)  = o 


and  0 satisfies  boundary  conditions  (cf.  (3.24),  (3.  25)) 

o+(o,e,t)  = -(a+(0))_1  dJ  a"(0)  Q"(o,e,t)  , 

0f(x,  0,t)  = -0_(X,  0,  t)  A'(0)  D0(A'(0))'1  , 

Q'd,e,t)  = -(A'(i))'1  d[a+(d  ofd,e,t)  , 

Q (x,  1,  t)  = -Q+(x,l,t)  A+(l)  D^A'fl))'1  . 


Assuming  the  system  (9-24)  - (9-  30)  can  be  solved  for  Q,  the  formula 
(9.22)  relating  the  optimal  control  to  the  "adjoint"  solution  v becomes 
a linear  feedback  control  law 

u(t)  = -U"1  DT  A+(l)  / Qf(l,£,t)  w(£,t)d£  . (9.31 

0 

The  system  (9.  24) -(9.  30)  has  a very  complicated  appearance, 
(9.24),  (9.25)  are  nonlinear  with  the  nonlinearities  involving  boundary 
values  of  the  solution.  Moreover,  (9.24)  is  a linear  hyperbolic  system 
in  three  independent  variables  - ordinarily  far  more  complicated  than 
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hyperbolic  systems  in  two  variables.  It  turns  out,  however,  that  the 


fact  that  A(x),  A(|)  are  simultaneously  diagonal  matrices  permits 
definition  and  use  of  characteristic  curves  (as  opposed  to  characteristic 
surfaces)  in  much  the  same  way  as  one  does  for  two  independent  variables. 
Moreover,  despite  its  forbidding  appearance,  (9.24)  is  semilinear  ([11]). 

As  a consequence,  the  local  existence  and  uniqueness  theory  for  (9.24)- 
(9.  30)  for  t near  T is  very  standard,  requiring  no  tools  more  compli- 
cated than  those  already  existing  in  Chap.  V of  [11]. 

Complete  solution  of  the  optimization  problem  requires  that  the 
solution  Q(x,  £,  t)  be  continued  over  the  whole  interval  0 £ t < T . 

For  this  an  a priori  bound  is  required  and  it  is  provided  through  the 
observation  that  if  the  problem  of  minimizing  (9.  19)  is  posed  on  any 
interval  [t,T]  , 0 < t < T,  with  initial  state 

w(*  , T)  = WT  € L2([  0,  1]  ; En)  , 

then  the  minimal  cost  turns  out  to  be 

1 1 

f / (wT(xh  Q(x,  t>  T)  wt(£))  n dx  d£  . 

0 0 E 

This  cost,  by  virtue  of  its  minimality,  is  less  than  or  equal  to  that 

realized  with  use  of  a fixed  linear  feedback  relation 

1 

u (t)  s f K(x)  w(x,  t)dx  (9.32) 

K 0 

for  which  the  cost  is  seen  to  be 
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1 1 

f f (w  (X),  Q (X,  i,  T)  W (£))  dx  df  , 

0 0 T K ' En 

wherein  (3  satisfies  a system  of  the  form  (9.  24) -(9.  30)  but  with  P(Q) 
replaced  by  P (Q  ) : 

i\  K. 

(PK(°KC,  - ,t)))  (x,  e)  - -W(x,£)  - K(x):  UK(£) 

-K(x)  D1  A+(l)  Qk(1,  £,  t)  - Qk(x,  1,  t)  A+(l)  D K(£)  . 

The  system  satisfied  by  Q is  thus  linear  and  solutions  may  be  obtained 

K 

on  arbitrary  t-intervals.  The  a priori  bound  for  Q is  then 
1 1 

f f (w(x),  (Q  (x,e,x)  - Q(X,e,T))  w je))dxdg  >0  (9.33) 

0 0 K 

and,  properly  exploited  (see  [91  ]),  this  enables  Q to  be  extended  to 
f0,  T]. 

Of  primary  interest  in  this  analysis  is  what  happens  as  T -*■  . 

If  one  can  show  that  there  is  some  Q(x,  £)  such  that 
1 1 

/ / (w  (x),  (Q(X,e)  - o (x,  e,  t))W  (in  dxdt  > o (9.34) 

0 0 E 

Z n 

for  all  w^e  L ([0,1];  E ),  and  for  re  [0,  T ] with  T arbitrarily  large, 
then  via  (9.  33)  we  obtain  a global  bound  for  0 on  arbitrary  intervals  as 

well.  Combined  with  the  fact  that  Q is  monotone;  if  < r ? 

1 1 

/ / (w(x)  (Q(x  £,  t ) - Q(x,  £,  t ))  w(£))  dxd£  > 0 , 

0 0 1 2 En 

and  the  fact  that  the  system  (9.  24) -(9.  30)  is  autonomous,  such  a bound 
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allows  one  to  see  that  as  T — ^ one  obtains  a constant  form  linear 


feedback  control  law 

u(t)  = -U’1  D1  Af(l)  f Q*(l,£)  w(£,t)d£  (9.35) 

0 

which  synthesizes  the  optimal  control  for  the  problem  with  cost  functional 
(cf.  (9.19)) 

°e  1 1 

J(u,  w ) = f [f  f (w(x,  t),  W(x,  £)  w($,  t))  dxd| 

0 0 0 E 

+ (u(t),  U,  u(t))  ldt  . (9.  36) 

Eq 

The  matrix  Q satisfies  (cf.  (9.24))  the  time  independent  equation 

OC 

9Q  9Q 

A(x)  ~dT  + Tf  A(e)  + P(Q)  = 0 

with  boundary  conditions  again  of  the  form  (9-  27)-(9-  30). 

Now  in  order  to  establish  the  vital  uniform  inequality  (9.  34)  one 
must  establish  that  solutions  of  (9.  15),  (9.  16),  (9-  17)  with  u = u^ 
determined  by  (9.  32),  for  some  appropriate  K,  decay  appropriately 
fast  as  t -*  * . in  general  this  part  of  the  theory  remains  incomplete. 

The  only  case  where  it  can  be  said  to  be  completely  solved  is  for  the 
system  (4.  21),  (4.  22), ( 4.23)  of  Section  4 - and  even  there  the  situation 

is  complex  enough. 

Suppose  the  boundary  conditions  (4.22),  (4.2  3)  are  such  (see 
Assumption  3.  8 of  Section  3)  so  that  a in  (4.  9)  has  negative  real  part. 
Then  the  eigenvalues  or^_  of  the  operator  (4.  8)  asymptotically  lie  on 

-188- 


the  line  Re(z)  = Re(n)  < 0 . Feedback  of  the  form  (1.  SO),  which  agrees 


with  (9.  32),  may  then  be  used  to  ensure  uniform  exponential  decay  of 
solutions  of  the  closed  loop  system.  Once  we  have  this  we  assure  the 
cost  (9.  19)  with  T - ao  is  finite  for  each  w^  « L^([0,  1]  ; En)  when  the 
control  determined  by  (4.  50),  (9.  32)  with  k = K chosen  to  ensure 
uniform  exponential  decay  of  solution,  is  used.  Jfu  , w ) takes  the 

i\  u 

form 


I(uK»  wo! 


1 1 

= 1 f 

0 0 


(Wq(X), 


0 (x,£) 


dxd£ 


with 


0 (x,  £)  = lim 
T — 3C 


QK(x,e,o) 


(=0K  T(*,£,0)) 


Thus  our  ability  to  achieve  uniform  exponential  decay  of  solutions 
via  linear  feedback  is  critical  in  obtaining  (9.  34)  which,  in  turn  leads 
to  the  possibility  of  obtaining  the  control  u which  minimizes  (9-  19) 
with  T = x in  the  fixed-form  linear  feedback  representation  (9.  35). 

We  have  seen  in  Section  4 that  this  ability  to  determine  system  behavior 
via  feedback  is  dependent  on  our  having  fully  analyzed  the  controllability 
of  the  system  beforehand. 

It  is  also  possible  to  carry  out  this  program  for  the  general  system 
(9.  IS),  (9.  16),  (9.  17)  if  Assumption  3.  8 is  valid  (rather  than  just  (3.  64) 
and  (3.  65)  of  Assumption  3.  8 which  assure  Re(  a)  < 0 in  the  case  of  (4.  1), 
(4.2)).  But  when  Assumption  3.8  is  satisfied  the  system  (9.15),  (9.16), 

(9.  17)  is  asymptotically  stable  with  u(t)  s o and  the  situation  is  much 
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less  interesting.  What  we  need  are  results  comparable  to  those  of 
Section  4 for  the  general  system  (9.  15),  (9.16),  (9.17).  This  remains 
an  open  problem  at  present. 

If  we  begin  with  a system  (4.  1),  (4.2),  wherein  Re(cr)  > 0 (or, 
more  generally,  a system  (9.15),  (9.16),  (9.  17 ) wherein  the  conditions 
(3.64),  ( 3.6  5)  are  not  satisfied)  we  cannot  expect  to  achieve  uniform 
exponential  decay  of  solutions  of  the  closed  loop  system  using  distributed 
feedback,  (4.  50)  or  (9.  32).  In  this  situation  the  relevant  theory  requires 
modification  of  the  cost  functional  (9.  19)  so  that  it  takes  the  form 

T 1 

J(u,  w,  T)  = f f ( w(x,  t),  W(x)  w(x,  t))  dx 
0 0 E 

+ (u(t),  U u( t) ) Jdt  . (9.  37) 

Eq 

Effectively,  the  weighting  matrix  function  W(x,  £),  previously  defined 

in  the  square  0 < x < 1,  0 < £,  < 1,  is  replaced  by  a distribution  with 

support  on  the  diagonal  line  segment  x = £,  0 < x < 1 . The  solution 

Q of  the  Riccati  partial  differential  system  corresponding  to  (9- 24) -(9.  30) 

is  then  distribution  valued  also  and  the  entire  system  is  immensely 

complicated.  The  feedback  relations  considered  take  the  form 

1 

u(t)  = Kw  (1,  t)  + J K(x)  w(x,  t)dx 
0 

and,  in  the  feedback  relation  (cf.  (9.  31))  characterizing  the  optimal 
control  for  (9.  37)  K and  K are  expressed  in  terms  of  Q by  very 
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complicated  formulae.  The  theory  here  is  quite  incomplete,  particularly 
with  regard  to  the  regularity  of  the  solution  Q . The  most  complete 
extant  treatment  is  given  by  H.  L.  Koh  in  his  thesis  [50].  He  more  or 
less  completely  analyzes  systems  of  the  form  (4.1),  (4.2)  such  as  we 
have  treated  in  Section  4 and  goes  on  to  give  less  complete,  but  still 
significant,  results  for  more  general  systems.  The  complexity  of  the 
calculations  are  daunting  but  the  basic  scheme  outlined  above  for  the 
simpler  problem  is  adhered  to  and  controllability,  together  with  related 
stabilizability  (particularly  in  connection  with  Theorem  3.  9)  continues 
to  play  the  decisive  role. 

9(c)  Time  Optimal  Control 

We  begin  with  a little  background  in  the  context  of  the  finite 
dimensional  system 

x = Ax  + Bu,  x e En,  u € U , (9-  38) 

where  we  shall  suppose  that  U is  a bounded  polyhedron  in  E™  . Two 

points  x x^  € En  are  given  and  one  seeks,  if  possible,  to  steer  from 

x to  x . with  a control  u e L [0,  T]  whose  values  lie  in  U for  each 

or 

u € [0,T],  and,  most  importantly,  with  T as  small  as  possible.  A 

control  u realizing  this  objective  is  a time  optimal  control.  Assuming 

~ ~ 2 ~ 

there  is  any  finite  interval  [0,  T]  and  control  u e L [0,T]  with  values 
in  U,  one  can  establish  (see  [76],  [77],  [55])  the  existence  of  the 
optimal  time  T and  the  existence  of  an  optimal  control  u . Moreover, 
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^ A 

u is  characterized  by  the  fact  that  for  almost  all  t « [0,  T]  u(t)  solves 
the  linear  programming  problem 


min  (M(t),  Bu)  n 
U€  U E 

where  ^ is  a non-trivial  solution  of  the  adjoint  equation 

* 

= - A 


(9.  39) 


With  appropriate  "normality"  assumptions,  the  fact  that  u solves  (9.  39) 

guarantees  that  we  have 

N 

[0,  T]  = U I u(t)  = v t € Int(I  ) , 
i=l  1 1 

where  the  I.  are  closed  intervals,  overlapping  at  most  at  common 
boundary  points,  and  the  v are  vertices  of  the  polyhedron  U . This 
result  is  known  as  the  "bang-bang"  principle  [51]  since  it  implies  that 
the  time  optimal  control  u "bangs  around"  from  one  vertex  of  U to 
another  during  the  control  period.  The  simplicity  of  this  type  of  control 
action  led  to  great  interest  on  the  part  of  control  engineers  during 
(roughly)  the  period  1955-1965  but  that  interest  waned  somewhat  as  it 
was  realized  that  synthesis  of  u(t),  i.  e.,  expression  in  terms  of  the 
system  state  in  a (non-linear)  "feedback"  form,  was  almost  hopelessly 
complicated  for  all  but  the  simplest  systems  and  that  there  were  many 
practical  drawbacks  associated  with  such  abrupt  control  switching. 
Nevertheless  the  theory  has  found  significant  application  in  a number 
of  areas  and,  in  addition,  has  appealing  mathematical  simplicity  and 
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elegance.  It  is  natural  that  a number  of  attempts  have  been  made  to 


extend  the  theory  to  distributed  systems. 

By  the  time  the  reader  has  reached  this  point  in  the  paper  he  will 
not  be  surprized  to  find  that  time  optimal  control  theory  is  very  different 
for  different  classes  of  partial  differential  equations.  For  hyperbolic 
systems  there  is,  strictly  speaking,  no  true  "bang-bang"  principle  at 
all.  This  is  easy  to  see  in  the  context  of  the  system  studied  in  Section  4. 
With  initial  and  terminal  states  given  at  t = 0 and  t = 2,  respectively, 
there  is  exactly  one  control,  given  by  formula  (4.  18),  i.  e. , 

oc  c 

u(t)  = Yj  (~ ) yT  • 

f=-oe 

Now  it  can  be  shown  that  the  p are  bounded  functions.  Hence  if  the 

( 

coefficients  c^,  determined  by  the  initial  and  terminal  states,  decrease 
rapidly  enough,  the  control  u(t)  will  satisfy  any  given  a priori  bound, 
e.  g. , 

-1  < u(t)  < 1 (i.  e.  , cf.  (9.  38),  U = [-1,1]  . (9.40) 

In  general  the  time  t = 2 is  optimal  and  u(t),  given  by  (4.  18)  is  the 

2 

time  optimal  control  because  it  is  the  unique  control  in  L [0,  T]  . But 
it  is  clear  that  u(t)  is  not  ordinarily  a function  assuming  only  the  values 
± 1;  indeed,  if  the  c^  decrease  rapidly  enough,  u(t)  will  possess  any 
desired  degree  of  differentiability  in  [0,  2].  Thus  no  all-inclusive  bang- 
bang  principle  of  the  type  which  we  have  for  finite  dimensional  systems 
(9.  38)  obtains. 

-193- 


We  present  here  a bang-bang  time  optimal  control  result  for  the 


parabolic  control  system  of  Section  7,  i.  e.  , 


9w  9 w . . , . 

— - -r—  + r(x)w  = g(x)  u(t)  , 
9t  9x 


aQ  w(0,t)  + bQ^(0,t)  = 0,  alW(lft)  + b^fl.t)  = 0 . 


(9.41) 


(9.42) 


The  result  also  applies,  without  essential  modification,  to  the  boundary 
control  system  described  in  that  section. 

We  consider,  then,  the  system  (9.41),  (9.42)  with  the  added  re- 
striction 

-1  < u(t)  < 1 . (9.  43) 


For  convenience  (and  without  loss  of  generality)  we  assume  here  that  the 
initial  state  is 

w( • , 0)  = 0 (9.44) 

and  our  goal  is  to  reach,  in  minimal  time  t,  a given  terminal  state 

w(*  , t)  = Wj  . 

Disregarding  (9.4  3)  for  the  moment,  this  can  be  done,  e.g.,  if  is 

I 

a state  in  the  domain  of  the  operator  exp(A^  (-L)‘)  as  described  in 
Theorem  7.  3;  and  it  is  a genuine  curiosity  of  the  theory  that  it  is  only 
for  those  states,  which  one  has  constructively  shown  to  be  reachable, 
that  the  bang-bang  control  principle  can  be  established.  Our  approach 
is  a simpler  version  of  a more  general  result  due  to  Fattorini  [26]  and 
follows  along  the  same  lines  as  an  earlier  result  of  Egorov  [18],  who 


also  confronted  this  restriction  on  the  result.  Whether  or  not  the  result 
holds  for  all  reachable  states  appears  to  be  completely  unknown  at  this 
writing. 

The  basic  idea  is  to  impose  a particular  topology  upon  the  space 
of  reachable  states  which  we  have  identified  in  Theorem  7.3.  To  do  this 
we  again  resort  to  the  method  of  biorthogonal  functions.  In  Section  7 
we  noted  the  existence  of  biorthogonal  functions  with  (cf.  (7.  33)) 

K'z.*  < expIKi-k)  ' 

L [0,  t] 


It  turns  out  that  the  p can  be  modified  slightly  (see  [27])  to  another 

-v 

set  of  biorthogonal  functions  q^  for  the  e such  that  each  e C[0,t 


'kllc[°'Ti " t«ToT-i ~ K°e’ip,Kl  “k> 


(9.45) 


for  some  K ^ > 0 . Or,  the  same  result  can  be  achieved  ab  ovo  via  the 
Fourier  transform  method  (see  [27]  again)  which  we  employed  in  a slightly 
modified  setting  in  Section  8.  Let  us  put 


bk  = llqkyc[0,  r]  • 


(9.46) 

We  have  noted  that  elements  w t L [0,  1]  can  be  expanded  in  series 

(9.47) 


on 


W = V p.  <t>. 

k=l  k k 


where  the  4>,  are  the  eigenfunctions  of  the  Sturm-Liouville  operator 
2 k 

Lw  ^ + r(x)w  . We  likewise  have,  for  the  element  g in  (9.41)  , 

9xZ 
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g = £ gk  \ • 

k=l  K * 


Assuming  that  no  = 0,  we  define 


00  bu 

_ v L 


(9.48) 


k=l  I g, 


2 

for  those  w in  the  subspace  B C L [0,  1]  which  thereby  yield  a finite 

sum.  With  the  norm  (9.48)  B becomes  a Banach  space.  Each  w^  € B 

lies  in  the  set  of  reachable  states  for,  using  now  the  instead  of  the 

p,  , w,  is  reached  by  the  control 
k 1 

uw  = £ «r  qk(T_t) 

k=l  gk 

and  for  all  t e [0,  t] 


lu(t)l  5 i?i  \ b|c=  llw‘*B  ' 


(9.49) 


Finally  we  note  that  one  can  establish,  much  along  the  lines  of  Fattorini’s 
proof  of  time  invariance  of  the  reachable  set,  that  the  above  definition 
of  B is  actually  independent  of  the  time  t . 

We  are  now  in  a position  to  state  and  prove  the  main  result. 

Theorem  9. 1.  Suppose  a state  w « B is  reached  at  time  t>  0 from 
(9.  44)  via  (9.41),  (9.42)  with  a measurable  control  u which  satisfies 
(9.4  3)  and  suppose  this  time  t is  minimal  with  respect  to  all  measurable 
u satisfying  (9.43).  (We  remark  that  while  there  is  no  minimal  time  t 
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for  square  integrable  controls  it  can  be  shown  to  exist  for  controls 
satisfying  (9.  4 3).  See,  e.g.  [57].)  Then  |u(t)|  = 1 for  almost  all 

A 

t e [0,  t ] . Moreover,  the  time  optimal  control  u(t)  is  characterized 
by  the  formula 

u(t)  = sgn  -r|(t),  T^t)*  0 

where  ri(t)  is  real  analytic  for  t < t with  discrete  zeros  accumulating 


a neighborhood  of  the  origin  in  B . 

Proof.  Immediate,  for  R(T)  includes  all  we  B with  l|w||  <1  by 

virtue  of  (9.  48). 

Lemma  9.  3 . If_  Wj  e R(  t)  but  not  to  any  R(T)  for  T < t (so  that  t 
is,  indeed,  the  minimal  time  to  reach  w^  ) then  w^  belongs  to  the 
boundary  of  R(t)  in  the  Banach  space  B . 

Proof.  If  not,  then  w e Int(R(x))  in  B and  R(t)  includes  the  set 

N (w, ) = (w.  + w | II  w ||  < e } 

ell  d 

for  some  e > 0 . Then  there  is  some  r < 1 such  that  ~ wj  £ R(T)  and 

hence  — w,  is  reachable  in  time  t . Assuming  the  expansion  (9.47) 
r 1 
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for  w,,  - w,  e R(t)  means  there  is  some  u satisfying  (9.44)  on  [0,  t] 

1 r 1 

such  that 

T -X  (T-t)  j 

g k f e u(t)dt  = - nk,  k = 1,  2,  3,  . . . . 

Now  let  6 > 0 and  define 


w6  = £ \\ 
k = l 


t-6  -X  (r-6-t) 

v.  = g,  / e u(t+6)dt  . 

K tc  o 

It  is  clear  that  wr  <=  R(t-6)  if  it  belongs  to  B . To  show  the  latter  we 
0 

compute 


7 ^k  - \ 


T -X  (T-t)  T-  6 -X  (T-6-t) 

g f e u(t)dt  - g,  f e u(t+6)dt 

K 0 


-X  (r-t) 
e u(t)dt 


< 


30 


6 


-vt-6) 

e 


(9.  50) 


Now  (9.45),  (9.  46)  show  the  last  sum  to  be  finite  so  Wj  - w^  lies  in 

B and  thus,  as  we  have  noted,  in  R(t  - 6)  . It  is  also  clear,  from  (9-  50) 

that  we  can  make  ||—  w - w |'|  as  small  as  we  wish  by  taking  6 > 0 

r 1 6 B 

small.  Then  we  can  also  make  |lw  - rw  ||  as  small  as  we  wish. 

1 o B 

But,  since  r < 1,  rw  clearly  lies  in  R(  t - 6)  and  is  reached  with  the 

6 

control  ru(t+6)  which  satisfies  -1  < -r  <_  u(t+6)  r < 1 for  t « [0,  t -6]  . 
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1 


Using  controls  v(t)  with  -(1-r)  < v(t)  < 1-r,  t«  [0,  t-6  ],  we  conclude 
from  Lemma  9.  2 that  R(  t-6)  also  includes  the  set  {r  w&  + w | ||  w ||  < 1-r } . 

Taking  6 small  enough  so  that  ||w  - rw^  ||  < 1 - r we  have  w^  « R(t-6)  . 
This  is  contrary  to  our  assumption  that  t is  the  minimal  time  for  which 
Wj  e R(t)  . We  conclude  Wj  belongs  to  the  boundary  of  R(t)  in  B and 
the  proof  of  the  lemma  is  complete. 

* 

Lemma  9.4.  There  is  a continuous  linear  functional  r|  e B with  the 
representation 


or 

gfw)  = g(^ 
k=l 


°k  V 


V 


w 6 B 


(9.  51) 


such  that 


(i)  j t|^  I <_  K ^ , k = 1,  2,  3,  . . . for  some  K > 0 ; 


(ii)  r|  (wt)  > ri(w),  w € R(  t) 


Proof.  The  existence  of  t)  satisfying  (ii)  follows  from  Mazur's  separation 
theorem  ([17])  after  we  take  note  that  R(r)  is  convex  and,  from  Lemma  9.2, 
includes  a neighborhood  of  the  origin. 

Now  (9.48)  clearly  shows  that 

= 1,  k = 1,2,3,... 

B 


from  which  we  conclude 
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and  we  have  (i)  with  K = ||  ri  II  ^ 

b” 

We  can  now  complete  the 


Proof  of  Theorem  9.  1.  We  have 


°°  S -\c(T-t)  - 

w,  = V g ( | e u(t)dt)<p 

1 k=i  k " 0 


where  u is  a control  steering  0 to  w^  during  the  minimal  time  interval 
[0,  t ] . For  any  other  control  u satisfying  (9.  44)  on  [0,t]  we  have 


the  endpoint 


00  T -X  (T-t) 

= Yj  gw(  / e u(t)  dt) 

k=l  0 


If  w(T)  £ R(t),  Lemma  9.4  applies  to  give 

p(w  - w(t))  > 0 


(9.  52) 


for  some  g e B'  . Using  (9.  51)  this  becomes 
t oc  - X (T-t) 

f ^k  9k  6 )(u(t)  - u(t))dt  > 0 , 

0 k=l 

the  change  in  order  of  summation  and  integration  being  easily  justified 
from  (i)  of  Lemma  9.4.  That  property  of  the  r|k  also  shows  that 


ti(z)  = V t,  g e 


•Xk(T-Z) 


is  analytic  in  the  open  left  half  plane  Re(z)  < t . Hence  its  zeros  on  the 
positive  real  axis  are  discrete  with  no  accumulation  point  other  than 
z = 0 . 
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Consider  then  any  interval  ( t ^ , t f ) with  0 < t < t^  < T wherein 
r| ( t)  is  of  one  sign,  w.  1.  o.  g.  positive.  If  we  let 


u(t)  = 


u(t),  1 « (t^) 


(9-  93) 


t£  (VV 


then  u satisfies  (9.44)  on  [0,  t]  and,  since  t < t,  the  solution 


w(t)  - w(t)  of  (9.41),  (9.42)  corresponding  to  u(t)  - u(t)  satisfies 


.2, 


o(w  - w)  d (w  - w) 


at 


f r(x)(w  - w)  = 0 


9x 


for  t < t <_  t , This  last  fact  shows  quite  readily  that  w(t)  - w(t)  = w^  - w(t) 


lies  in  B and  hence,  since  we  know  w^  e B,  that  w(t)  c B as  required 


for  (9.  52).  But  (9.  52),  with  u given  by  (9.  53),  becomes 

t 


/ n(t)(u(t)  - I)dt  > o 


and,  since  we  have  assumed  r|(t)  >0  in  tj,t.^,  we  must  have  u(t)  = 1 , 
t € (tj,  t?)  . From  this  we  see  that  for  all  t e [0,  t)  with  r|(t)  * 0 we 
have 

u (t)  = sgn  r| ( t ) 


and  the  proof  is  complete. 


We  note  that  ^(t)  has  the  form  (y(- , t),  g)  where 

L [0,1] 

It-  + ~r  - r(x)  y = 0,  x € [0,  1],  t < t . 


9t  v2 

ax 


-201- 


In  general  y(-  , t)  <j  L^fO,  1 ],  however.  Vre  have  noted  that  the  zeros 

of  Ti(t)  are  discrete,  accumulating,  if  at  all,  only  at  t = t . Hence 

the  optimal  control  u (t)  assumes  the  values  ±1  on  an  at  most 

countable  sequence  {I.}  of  intervals  separated  by  "switching  times" 

t with  lim  t = t . Thus  u(t)  is,  indeed  a "bang-barg"  control, 
i 1 

1 — °° 

The  role  played  by  controllability  in  this  theory  is  very  clear. 

It  is  necessary  to  define  a space  of  reachable  states  whose  topology 
is  strong  enough  so  that  the  sets  R(t)  defined  above  have  interior 
points,  allowing  Mazur's  theorem  to  be  applied,  and  yet  also  weak 
enough  so  that  R(t)  moves  continuously  with  t,  permitting  one  to 
conclude  the  result  of  Lemma  9.  3.  These  conflicting  requirements  on 
the  topology  of  B require  that  a great  deal  be  known  about  the  reachable 
points  and  their  relationship  to  the  controls  which  achieve  them.  Similar 
observations  were  made  by  Quinn  in  [78]  in  connection  with  other  time 
optimal  control  problems  for  distributed  systems. 

We  note  in  conclusion  that  a numerical  procedure  for  determining 
the  switching  times  t.  described  above  has  been  proposed  by  Goldwyn 
Sriram  and  Graham  in  [32]  with  impressive  numerical  success.  The 
theoretical  basis  for  the  method  appears,  however,  to  require  some 
strengthening. 
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